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The  analysis  of  variance  is  a  comironly  used  statistical  method  by  which 
estimates  of  a  number  of  variances  are  made  and  by  which  the  significance  of 
the  differences  between  these  estimates  is  determined.    More  recently  the  var- 
iances have  been  used  as  the  basis  for  estimation  of  variance  components.  The 
arithmetic  procedures  involved  are  straightforward  when  the  data  are  based  on 
the  same  frequency  in  each  subclass. 

On  the  other  hand,  unequal  cell  frequencies  are  not  uncommon  in  some 
t:^a^es  of  exoerimentation.    This  is  particularly  true  in  animal-breeding  ex- 
periments concerned  with  litter  size  and  sex.    In  addition,  animals  are  fre- 
quently lost  ,during  the  experimental  oeriod.    As  a  consequence,  the  ideal  of 
equal  numbers  in  each  subclass  or  classfication  is  seldom  attained. 

The  relation  of  least-squares  procedures  to  the  analvsis  of  variance  in 
recent  years  has  been  emohasizBd  by  many  authors.    For  data  conforming  to  the 
conventional  experimental  designs,  short-cut  procedures  are  well  known  which 
make  the  least- squares  method  unnecessary »    However,  it  is  only  recently  that 
the  power  of  least-squares  methods  for  the  analysis  of  data  with  unequal  sub- 
class frequencies  has  been  recognized. 

The  method  of  fitting  constants  (least-squares  analysis)  is  presented  in 
this  article  as  related  to  (1)  the  one-wa--^  classification,  (2)  one-way  class- 
ification with  regression  or  covariance,  (3)  two-way  classification  without 
interaction,  (h)  two-wav  classification  with  interaction,  (5)  multiple  and 
nested  classifications,  and  (6)  soecific  anim.al  crossbreeding  experiments. 

The  least-squares  methods  herein  discussed,  while  related  specifically 
to  animal  breeding  data,  are  equally  applicable  to  data  with  unequal  subclass 
frequencies  from,  any  other  area  of  research. 

Copies  of  this  corrected  version  may  be  obtained  from  Data 
Systems  Application  Division   (DSAD) ,  Aaricultural  Research  Service, 
U.S.  Department  of  Agriculture,  Room  013,  NAL  Building,  Beltsville, 
Maryland  20705. 


Judson  0.   McGuire,  Jr. 
Staff  Specialist,  DSAD-ARS 
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LEAST-SQUARES  ANALYSIS  OF  DATA  WITH 
UNEQUAL  SUBCLASS  NUMBERS 

Walter  R«  Harvey 


INTRODUCTION 

Disproportionate  subclass  frequencies  al-rays  cause  the  different  classes 
of  effects  to  be  non-orthogonal.    This  means  that  the  different  kinds  of 
effects s  such  as  years,  sex,  and  age  of  dam,  cannot  be  separated  directly 
wi-thout  entangleinent#    In  order  to  free  these  effects  from  the  entanglement 
or  confounding,  it  is  necessary  to  resort  to  siirai-ltaneous  consideration  of  all 
effects.    With  equal  subclass  frequencies  the  effects  and  suras  of  squares  for 
tests  of  significance  are  obtained  directly  from  the  class  or  subclass  totals 
which  is  equivalent  to  the  simultaneous  consideration  of  all  effects.  In 
this  case  the  design  is  said  to  be  balanced  and  the  effects  are  all  mutually 
orthogonal* 

All  statistics  which  can  be  coKiputed  from  data  with  equal  subclass  fre« 
quencies  can  also  be  computed  from  data  with  unequal  subclass  frequencies  by 
use  of  appropriate  statistical  methods  which  are  presently  aYailable,  For 
example,  with  fixed  effects  the  investigator  is  interested  in  the  computation 
of  means,  regression  eoefficientsj  standard  errors,  tests  of  significance^ 
orthogonal  comparisons interactions  and  mean  separation  procedures?  whereas, 
with  random  effects  one  is  interested  in  the  estimation  of  variance  components 
andwests  of  significance.    Methods  of  computing  these  statistics  from  data 
with  equal  subclass  frequencies  are  well  known  and  are  given  in  most  standard 
textbooks  for  different  designs*    It  is  the  purpose  of  this  presentation  to 
give  presently  available  orocedures  for  computing  all  these  statistics  from 
data  with  unequal  subclass  numbers  in  considerable  detail.    The  methods  are 
presented  by  beginning  with  the  simplest  models  and  progressing  to  the  more 
complex  models  that  are  so  often  encountered  in  the  analysis  of  animal- 
breeding  data. 


1/    Biom.etrician ,   In  Charge,  of  the  then  Livestock  Research  Staff,  

Biometricol  Services,  ARS ,  USDA,   Aaricultural  Research  Center, 
Beltsville,  f'aryland  —  presentlv  Professor,  Department  of  Dairy  ^^-^ 
Science,  Ohio  State  University,  Colnmbus ,  Ohio. 

The  writer  wishes  to  express  appreciation  to  the  professional  staff 
of  Biometrical  Services  for  their  suggestions  in  the  preparation  of  the 
manuscript.    Special  acknowledgement  is  given  Dr.  R.  A.  Damon,  Jr., 
Biometrician  in  the  Livestock  Research  Staff,  for  writing  the  section 
concerning  the  analysis  of  crossline  data  and  for  assistance  in  the 
development  of  several  short-cut  methods  presented. 
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LEAST-SQUARES  PRINCIPLES  IN  THE  ONE»WAY  CLASSIFICATION 


Methods  of  analyzing  data  classified  in  only  one-way  are  straight- 
forward even  though  unequal  nuufcers  exist  froE?  class-to-class.    It  is  in- 
tended to  demonstrate  here  that  analysis  ty  the  method  of  fitting  constants 5 
in  >diich  direct  matrix  arithmetic  is  involvedj,  yields  the  same  results  as  the 
standard  methods  of  computations,  thereby  clarifying  the  meaning  of  least- 
squares  analysis. 


tfodel 

In  any  analjrsis  it  is  important  that  the  mathematical  iriodel  underlying 
the  analysis  and  the  assumptions  made  in  using  the  model  be  well  known.  In 
the  one-way  classification  the  model  iss 

yi^  -     +       +  e^^^ 

i  »  1,  2,  p 
j  »  1,  2,  ni. 

where  s 

y^j  «  the  j'''^  observation  in  the  i"*^^  A  class »    (The  A  classes  may 
represent  agesj  or  years,  or  sires,  or  treatments,  etc), 

\x  =  the  overall  population  mean  when  equal  frequencies  exist 
among  the  classes  of  A, 

aj^  »  the  effect  of  the  i"^^  A  class  expressed  as  a  deviation  from 
the  overall  mean 

"  random  errors  which  are  assumed  to  be  independent.    If  tests 
of  significajice  are  to  be  made  then  it  is  essential  to  assume 
that  the  e^j  are  also  normally  distributed,  i.e,,  N(0,ct|), 

Each  of  the  terms  on  the  right  hand  side  of  the  equation  for  the  model 
are  population  parameters  or  constants.    Estimates  of  these  parameters  ob- 
tained in  the  analysis  are  designated  by  placing  a  circumflex  (^)  over  the 
letter,  e.g.,  ^  is  the  estimate  of  ^i,  a^  is  the  estimate  of       and  Oq  is  the 
estimate  of  a|.    If  the  &±  are  random  effects  then  the  variance  due  to  the 
a^,  cl,  is  estimated  by  al. 

Least-Squares  Equations 

The  least-squares  normal  equations  result  from  the  use  of  a  differential 
calculus  principle.    However,  is  is  unnecessary  to  actually  go  through  the 
differential  calculus  manipulations  in  order  to  write  out  tHe  equations, 
since  they  follow  a  systematic  pattern*    The  first  point  to  remember  in  the 
construction  of  this  set  of  equations  is  that  there  must  be  one  equation  for 
each  of  the  constants  to  be  estimated.    The  eqiiations  for  the  one-way 
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classification  are  as  fcllojsi 


\L"    Bo        n-,i-,      rigag  +  ...  +  ^^p^p  *  ^» 


In  this  notatioHj  the  letter  and  appropriate  subscript  on  the  left  followed  by 
a  colon  denotes  the  equation*    The  second  subscript  is  oiTdtted  in  the  n^s  and 
in  the  Y's  since  this  subscript  is  always  summed  over.    However,  a  dot  in  the 
subscript  indi.cates  that  summation  has  been  made  over  that  subscript,  e,g,, 
n.  '  Z  n±  where       "  the  nuniber  of  observations  in  the  ith  class.    The  coluian 

of  values  on  the  right  hand  side  of  the  equal  sign  is  referred  to  as  the 
right  hand  member  (RHM)»    Very  often  the  same  variance-covariance  matrix  is 
used  with  several  right  hand  members. 

The  least-squares  equations  may  be  represented  in  tabular  form  as 
follows  t 


p. 

^2    -  - 

-  - 

RHM 

^1 

n2    -  . 

% 

^1 

^1 

0    ^  . 

.  -  0 

^1 

0 

ng    -  • 

«  -  0 

^2 

Hp 

0 

0    -  . 

Four  features  of  least-squares  equations  should  be  noted  from  this 
table.    First,  the  elements  of  the  varlance-covariance  mati*iy  (the  left  hand 
members)  form  a  matrix  symmetrical  about  the  main  diagonal,  i.e,,  the  ele- 
ments on  the  right  hand  side  of  the  diagonal  which  runs  from  left  to  right 
are  a  mirror  image  of  those  on  the  left  side  of  these  diagonal  elements.  If 
the  rows  and  columns  are  numbered  1,  2,  ,,,  p  +  1  and  the  first  number  is 
used  to  identify  a  given  row  and  the  second  niimber  to  identify  a  column,  then 
the  element  in  the  12  position  is  the  same  as  the  element  in  the  21  position, 
the  element  in  the  23  position  is  the  same  as  the  element  in  the  32  position, 
etc.    For  the  problem  under  consideration,  elements  in  the  \i  equation,  as  the 
coefficients  for  the  a^^,  are  the  same  elements  as  the  diagonal  elements  in 
the  ai  equations.    Tliirdly,  the  off  diagonal  elements  in  the  section  of  rows 
and  columns  for  the  ai  are  all  zero.    The  fourth  feature  that  shoiild  be  noted 
is  that  the  sum  of  the  coefficients  for  the  L  in  the  \l  equation  is  equal  to 
the  coefficient  for  JI  in  the  same  equation.    Likewise,  the  sum  of  the  right 


hand  members  for  the       equations  is  equal  to  the  right  hand  member  for  the  ji 
equation.    All  these  features  are  characteristic  of  original  least-squares 
<J,quations    in  general,  where  constants  are  fitted  for  classifications « 

The  least-squares  equations  for  the  one-way  classification  may  be  shown 
in  a  more  reduced  form  as,. follows? 

\Lt    n,.  \i  +  Z  =  Y, 

In  tabular  form,  these  may  be  represented  as  follows s 


il 

ai 

RHM 

n. 

"i 

Y, 

Hi 

O^i^ 

where  the  zeros  in  the  ^ts%  section  indicate  that  the  off -diagonal  elements 
in  this  section  are  equal  to  aero. 


Imposing  Restrictions 

As  noted  above,  the  sum  of  the  coefficients  for  the  aj  equals  the  co- 
efficient for  p.  in  the  \i  equation  and  the  suui  of  the  RHM^g  for  the  equations 
equals  the  RHM  for  the  \i  equation*    Since  this  is  true  one  cannot  solve  the 
equations  until  some  restriction  (or  constraint)  has  been  imposed  on  the  con- 
stants®   Actually'"  there  are  only  p  independent  equations  so  that  an  additional 
relationship  imst  be  introduced 5  generally  known  as  a  constraint  or  re- 
striction, in  order  to  obtain  a  solution.    There  are  an  infinite  nuntoer  of 
ways  in  Ti^ch  this  can  be  done  from  a  raatherr&tical  viewpoint.    However,  only 
two  of  these  are  in  common  use  today  and  they  will  be  considered  here. 

A  direct  method  of  obtaining  estimates  of  the  ai  as  deviations^frora  \i 
is  to  impose  the  restriction  on  the  least-squares  equations  that  S  a^^  «  0* 

When  this  is  done  the  coefficients  of  one  of  the  &±f  say  Sp,  must  be  sub- 
tracted from  the  coefficients  of  the  other  ai.    The  resulting  elements  in  the 
ap  equation  are  then  subtracted  from  the  corresponding  elements  in  the  other 
aj_  equations  to  maintain  symmetry  in  the  equations.    In  addition,  the  RHM  for 
the  ap  equation  is  subtracted  from  the  RHM's  for  the  other  ai  equations.  Whea 
these  subtractions  are  completed  the  column  and  rojs  for  the  ap  equation  is 
deleted  and  the  resulting  symraetrical  set  of  equations  are  solved  to  obtain 
estimates  of  pi  and  the       directly^    After  subtraction  p  equations  remain, 
i.e.,  p-1  for  the  a^^'s  and  one  for  p.c. 

One  of  the  simplest  restrictions  (or  constraints)  one  could  impose  on  the 
aj_  is  to  assume  that  one  of  the  a|     0,  say  ap,  and  simply  delete  the  ap 
equation  and  the  column  of  coefficients  for  ap„    When  this  is  done  and  the 


remaining  equations  are  solved  to  obtain  estimates  of  the  unknowns  the 
.following  linear  functions  of  the  parameters  are  estimated: 

J'  =^  jx  +  sip 
a{    =       »  ap 
a^    -  ag  -  ^ 

^-p-1       ^p„i  -  ap 
-  0 

P 

In  the  model  the  a^  are  expressed  as  deviations  from  \l.  Hence,  it  is  logical 
to  assume  that  Z  a^^  »  0  and  obtain  the  constant  estimates  as  deviations  from 

ji.    Of  course,  the  iTiagnitude  and  sign  of  the  differences  among  the  a'^  and 
among  the       are  the  same.    After  obtaining  estimates  of  the  a^  one  can  com- 
pute estimates  of  the  sl±  by  forcing  the  sum  of  the  a^  to  equal  zero  since 

"I    a|/p  =  ap.    The  overall  mean,  ^,  can  then  be  separated  from  a_  and  the 
i«l  ^  ^      ^  ^ 

least  squares  me&ns,  p,  +  a.^,  can  be  computed. 

The  restriction  that  the  sum  of  the  constant  estimates  within  a  given 
set  sum  to  zero  is  preferred  to  the  restriction  that  one  of  the  constants  for 
each  set  is  equal  to  zero.    In  addition  to  the  advantage  of  obtaining  the  de- 
sired constant  estinates  directly  there  are  other  important  reasons  for  this 
preference  that  will  be  explained  later. 


Utilization  of  the  Matrix  Inverse 

Numerous  methods  exist  for  obtaining  the  solution  of  a  set  of  simul- 
taneous equations  and  for  obtaining  the  inverse  of  a  matrix.    None  of  these 
will  be  discussed  here.    Instead,  the  use  to  be  made  of  the  inverse  elements 
in  analysis  of  data  will  be  discussed  in  considerable  detail.    Some  of  the 
uses  made  of  the  inverse  elements  are  discussed  below. 

Computing  Estimates  of  the  Constants 

Although  the  constant  estimates  may  be  obtained  by  direct  solution  of 
the  equations,  often  it  is  more  convenient  to  obtain  them  from  the  inverse 
elements  and  the  RHM's  of  the  equations,  since 

Z  Ci^       -  Ci 

where  C^^  is  the  inverse  element  for  the  i"^'^  row  and  J"^^  column  of  the  com- 
plete inverse  matrix,       is  the  RHM  for  the  j'th  row  and  ci  is  the  ith 
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constant  estimate.    For  example. 


-  c21y.  *  c22(i^.Yp)  *  c23(Y2„Yp)  ^  ...  +  C^PCYp^^-Yp) 
etc. 

If  a  method  of  inversion  and  solution  is  used  so  that  both  the  inverse 
elements  and  the  constant  estimates  are  obtained  directlsr,  then  an  indirect 
check  on  the  accumulation  of  rounding  errors  is  available  by  also  computing 
the  constant  estimates  by  this  method  and  checking  them  agiinst  those  ob- 
tained directly. 


Standard  Errors  of  Constant  Estimates  or  Linear  Functions  of  Constant  Estimates 
The  standard  error  of  an  estimated  constant,  such  as  ^  or  s^,  is 

-  /"^r^i 

where  Cii  is  the  corresponding  diagonal  inverse  element  for  that  constant  and 

a|  ■  —i—  fzz  jfi  -  R(iJ.,aj)]   for  the  one-way  classification.    The  reduction 
n.-p  [ij     ^  J 

due  to  fit'iing  all  constants,   .R(|.  ,3^),,  may  be  computed  from  all  constant  es- 
timates and  the  original  RHM's,  i»e., 

R(p.,a5)  »  jl  Y«  +  aiYi  +  a2Y2  +        +  sipYp 

when  the  restriction  is  imposed  that  S  a^^  «  0,  since  ?  j  *  ^f"^  a±.    The  re- 

i  '  i«l 

duction  in  sum  of  squares  due  to  fitting  all  constants  may  be  computed  more 
easily  from  the  constant  estimates  and  the  reduced  RHM's.,  i.e. 

R(M.,ai)  »  5.  Y.  +  ai(li»Yp)  +  ...  Vl^Vl""'^P^* 

The  reduction  in  sum  of  squares  due  to  fitting  all  constants  is  the  same 
when  the  ap  equation  is  deleted  by  row  and  coIumi,  i.e., 

R(}i,a3)  =  ii'Y.  +  a^Y^  ^  ajYg  +        +  ap.i  Yp.i 

However,  when  interac^ions  ^re  included  in  the  sjodel  difficulties  arise  in 
imposing  the  restriction  that  certain  effects  equal  zero.    These  difficulties 
are  discussed  in  the  section  on  the  two-way  classification  with  interaction. 

The  standard  error  of  a  difference  between  two  constant  estimates  is 
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The  same  standard  error  for  a  difference  between  two  constants  is  obtained  re- 
gardless of  the  restriction  intposed  on  the  constant  estiinates.    When  the  re- 
striction was  imposed  that  ap  =»  0,  the  inverse  sleinents  for  the  ap  column  and 
row  all  equal  zero,  but  when  the  restriction  was  imposed  that  2  a^^  »  0  the  in- 
verse elesnents  for  the  Sp-colum  and-row  ar©  coisiputed  from  the  dependencies 
which  ca-ase  the  inverse  eleinents  within  set  to  sum  to  zero  by  rows  and 
columns.    For  axample^ 

cKp-^i)  ^-ici^     c^3  ^  clh  .       ^  C^P) 
C2(P+1)  »  ..(C22  +  C23  +  C2ii  4         +  C2p) 

c(p*l)(p+l)  „»(cl(P'«-l)  ^-  C2(P+1)  *  C3(P+1)  ^        ^  CP(P-^^) 

Hie  standard  errors  of  the  least-squares  means  are  obtained  as  follows? 


-     ^^11  +  di  +  2Cli)  o% 

Regardless  of  the  restriction  imposed,  this  standard  error  will  be  the  same, 
but  only  in  the  case  of  the  one-way  classification.    With  multiple  c?.assi- 
fications,  this  standard  error  is  difficult  to  obtain  if  a  coluuai  and  row  are 
deleted  in  each  set  of  constants.    With  the  restriction  that  the  sum  of  the 
constant  estimates  sum  to  zero  within  each  set,  the  least-squares  means  and 
their  standard  errors  are  easy  to  compute  from  the  constants  and  the  inverse 
elements » 


Computing  Sums  of  Squares  for  the  Analysis  of  Variance 

The  sum  of  squares  for  A  is  easily  obtained  directly  from  the  class  num- 
bers and  class  totals  in  the  one-way  classification.    However,  this  same  sum 
of  squares  may  be  obtained  in  the  more  general  way  as  shown  below: 

S.Sqs.  -  B«  B 

where       is  a  row  vector  of  the  constant  estimates  for  a  given  set  (such  as 
the  aj_)5  Z-1  is  the  inverse  of  the  segment  of  the  inverse  of  the  variance- 
covariance  matrix,  corresponding,  by  row  and  column,  to  this  set  of  constants; 
and  B  is  a  column  vector  of  the  set  of  constants.    The  sum  of  squares  ob- 
tained in  this  manner  is  equal  to  the  reduction  in  sum  of  squares  due  to 
fitting  all  constants  minus  the  reduction  in  sum  of  squares  due  to  fitting  all 
constants  except  the  set  being  considered.    This  sum  of  squares  may  be  obtained 
in  this  manner  regardless  of  i^hich  of  the  two  restrictions  have  been  imposed 
on  the  constants,    Obvioasly,  the  addition  of  the  extra  row  and  column  is  un- 
necessary if  a-p  has  been  assumed  to  equal  zero.    The  last  row  and  column  is 
also  unnecessary  if  the  restriction  was  imposed  that  Z  aj_  =  0. 
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Numerical  Example 

Suppose  the  data  given  in  the  table  below  represent  gains  of  individual 
barrows  that  were  randomly  assigned  to  three  different  rations. 


na%±on  No^ 


Pig  No. 

1 

2 

3 

1 

3 

5 

7 

2 

5 

6 

6 

3 

6 

2 

U 

I 

2 

7 

3 

5 

8 

6 

6 

3 

h 

7 

8 

9 

8 

Yi 

16 

h8 

30 

^i 

h 

6 

Completing  the  analysis  of  these  data  by  least-»squares  procedures  requires 
the  following  steps? 

1)  Mathematic  models 
where : 

71  j  ■        g^i'^-       ^®        barrow  on  the  i.^^  ration, 

\x  »  overall  mean  with  equal  numbers. 

=•  effect  of  the  i^^  ration. 

e^j  »  random  error  assumed  to  be  independent  and  normally 
distributed,  i.e.,  NID(G,ae)» 

2)  Least-squares  equations: 


h 

b 

RHM 

[It 

18 

h 

8 

6 

9U 

^1* 

h 

h 

0 

0 

16 

^2* 

8 

0 

8 

0 

18 

6 

0 

0 

6 

30 

3)    Imposing  Uie  restriction  that  Z  r^^  »  Os 


„  8  - 


The  reduced  least-sgiiares  equations  are  as  follows: 


16    -2  2 

»2  10  6 
2  6 


RHM 


9h 

IB 


h)   Inversion  of  the  variance-sovariance  Tr3.tT±T  and  conroutation  of  the  con- 
stants : 


(18    -2     2\  /e060l85     .0231li8  -.Ol8^9\ 

-2  10  6  »  I  .OimF  al435l9  -.061815  ......  (1) 
^2     6    IhJ         V.018519    «.05i:ei^  .10l852y 

t  -  (.060l85)(9b)  ^  (.0231h8)(-ll)  -  (.Ol85l9)(l8)  =  5 

ri  =  (.0231h8)(9li)  +  (.1J:;3519)(-1L')  -  (.06h6l5)(l8)  -  -1 
r2  -  (-.018319) (9li)  -  (»06b8l5)(-lli)  ( .101852) (18)  -  1 
^3  "  -(?1     ^2)  *  ^« 

5)    Setting  up  the  analyslB  of  variances 


Error  S.  Sqs 


«  22  yf^j  -  R(|.i,ri) 

«  566  -  (5) (91)  «  (-l)(-lii)  -  (1)(18) 
«  568  ^  502  «  66. 


Ration  S.Sqs. 


11.1112. 


The  analysis  of  variance  is  as  follows: 


Source  of  Variation  d,f. 


S,Sqs. 

11.1112 
66.0000 


M.S. 

li.laOOO 


F 

1.263  n.s. 


Rations  2 
Error  15 


9  - 


It  is  easily  verified  that  the  sum  of  squares  for  rations,  11,1112,  is 
equal  to 

h  8  6     "  18 

and  that  the  error  sum  of  squares  is  equal  to 

2Z  yf .  -  Z  ^  =  568  -  502  =  66 
ij  i  "i. 

Hence,  in  the  one-way  classification  the  between  group  uncorrected  sum  of 

squares,  Z  lii  equals    R(|j,,  a^),  even  though  the  number  of  observations 
i  ^i. 

varies  from  group-to-group.  Also,  in  the  one-way  classification  p.  and  the  t± 
can  be  obtained  directly  from  the  class  or  group  means  as  follows: 

,  a  ^  (h  +  6  +  5)  =  5  in  this  example,  and 

Ti  »  Ti  -  ^^• 
Hence,  in  the  present  example, 

=  h  -  5  «  -1 
«  6  -  5  »  1,  and 
r3  »  5  -  5  -  0  . 

6)    Standard  errors  and  individual  comparisons: 

(a)    The  standard  error  of  an  effect,  such  as  the  ri,  has  little  sig- 
nificance itself  since  it  cannot  exist  without  the  mean.  The 
standard  errors  will  be  computed  for  the  least-squares  means,  \l 
and  the  p.  +  rj_. 

=  V  (. 060185) (h.h)  =  .51 
sjl+jLj^  =  Vj[.060l85  +  .113519  +  (2)(.0231h8)](lj.[0 


\/  (.25)(b.)4) 

1.05 


s^I+fg  '  \/p60l85  +  .101852  +  (2)(-. 018519)]  (I.. b) 

-  y/  (.125)(li.li) 
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sjl+r^  "  \/f  060135  H  ,ll57bl  +  (2)(-.00li629)J  (U.U) 


V  (. 166668) (li.h) 

,86 


The  inverse  elements  for  the  coefficients  in  the  least-squares  equation 
that  were  subtracted  out-  above  are  necessary  to  obtain  the  standard  error 
for  p,  +  r-j.    These  were  obtained  as  follows: 

ai^Thl  -  .Ih35l9  +  .101852  +  (2)(-.061i8l5) 

and  -.001629  »  '.{.0231I18  -  .018519). 

Of  course,  in  the  one-way  classification  it  is  unnecessary  to  go 
through  all  this  manipulation  to  obtain  these  standard  errors.    They  niay 
be  obtained  directly  from  the  well  known  formula. 


where  s'^  is  the  estimate  of  the  error  variance,  a|,  and  n  is  the  number 
in  the  i"''^  group.  In  order  to  show  that  the  two  methods  give  identical 
results,  they  are  computed  directlv  below: 


1.05 


,86 


(b)    Individual  comparisons  among  the  t^* 

The  well  known  "t"  test  can  be  employed  to  coranare  any  two  of  the 
f j_  effects  if  desired,  but  becomes  less  satisfactory  as  the  number 
of  effects  increase.    The  ratio,  distributed  as  t,  is 
r^  -r, 

t  = 


Tests  of  significance  among  all  pairs  of  the  r^^  in  this  example 
using  the  "t"  test  are  given  below: 


(i) 


r2 


t  -  v([.3 10519  +  .101852  -  (2)(-.06h8l5))  ih.h) 
-2 

=  \/(.375)(h.l!) 


-2 

'  17255 

=  -1.56 
(ii)  T]^  -  r3  =  -1 


t  -  \/[.lU35l9  +  .115711  -  (2)(-.07870U)]  {h.h) 
-1 


=  V  (.hl6668)(l.U) 


=  -.7h 

(iii)  ?2  -  ^3  "  ^ 


t  =  v/[.10l852  +  .II57I1I  -  (2) (-.037037)]  [h.h) 

1 


V  (.291667) (h.h) 
1 

1.133 


Duncan's  multinle  range  test,  as  modified  by  Kramer,— ^  may  be 
employed  to  make  all  rtairwise  comoarisons  with  the  use  of  the  in- 
verse elements  and  the  standard  deviation  for  error.    If  the  values 


(y^  -  y^)  ycii  *  CJJ  -  2CiJ 


are  greater  than  Oq  Zp^np  ^'^^  difference  is  significant;  where  Zp^n2 
is  the  significant  studentized  range  value  in  Duncan's  tables  (either 
the  ,05  or  .01  table),  p  is  the  number  of  means  in  the  range  chosen 

ramer,  Clyde  Young.    Extension  of  multiple  range  tests  to  group 
correlated  adjusted  means.    Biometrics  13?  13-18.  1957. 
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and  n2  is  the  number  o-^  degrees  of  freedou!  for  error.    For  com- 
putational  convenience  and  for  this  example   these  tests  are  laade 
in.tabt!lar  form  below  at  the  ,05  level  of  probability? 


CoRparisoR    y±  -  \^ 


1*2  vs  r3  J. 
r2  vs  2 
r3  vs  I 


2.619 


2.62 


6,31 


2.309 


ii.62 


6.63 


2,191 


2ol9 


6.31 


All  differences,  of  course,  are  non-significant  for  this  example* 
These  tests  are  given  merely  to  illustrate  the  computational  pro- 
cedures involved » 

The  sums  of  squares  required  to  perform  tests  of  significance  for 
orthogonal  contrasts  among  a  set  of  constants  are  obtained  by  trans- 
forming the  segment  of  the  inverse  corresponding  to  the  set  of  con- 
stants.   This  transformation  is  equivalent  to  the  computation  of  the 
inverse  elements  corresponding  to  the  variances  and  covariances  of 
the  selected  orthogonal  contrasts.    These  elements  are  most  easily 
computed  by  means  of  a  relatively  simr)le  matrix  multiplication  pro- 
cedure*   If  T  is  the  transformed  segment  of  the  inyerse,  K  the 
transformation  matrix  and       the  square  segment  of  the  variance- 
covariance  inverse  with  the  additional  column  and  row  added  it  can 
be  shown  that  T  »  KZj^K», 

The  transformation  matrix  K  is  obtained  from  the  orthogonal  co- 
efficients which  define  the  desired  set  of  orthogonal  contrasts. 
Suppose  the  following  orthogonal  comparisons  are  desired  among  the 
three  rations  in  the  present  example? 


It  should  be  noted  that  the  sum  of  the  coefficients  by  row  in  the 
transformation  matrix  sum  to  aero  by  row  with  the  sign  considered 
but  to  unity  when  disregarding  the  sign. 

The  transformation  of  the  complete  segment  of  inverse  elements 
for  the  r±  and  the  computation  of  the  su^tis  of  squares  for  each  con- 
trast are  given  below? 


Then  the  transformation  matrix  is: 
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1  [2  -1  ^1 

H  [0     2  «2 


,113519  -.O6L815  -.078701 
..06L815  .101852  -.037037 
..07870h    -.037037  .115711 


KZaK'  = 


1 
H 

1 

16 


.b30557  -.19libh5  -.236112I 
.027778      .277778  -.305556] 

".h30557  -.19lihh5  -.236112I  1 
.027778     .277778    -.305556^  ^ 

"1.291671  .0833311 
.08333IJ  1.166668 


An  easier  method  of  computing  T  is  bv  subtracting  the  co- 
efficients in  the  last  column  of  the  transformation  matrix  K  from  the 
coefficients  in  the  other  columns  prior  to  the  matrix  multiplications. 
In  this  case,  the  last  column  and  row  for  Z  need  not  be  added,  i.e.. 


KZaK  = 


The  constants  for  the  two  orthogonal  contrasts  are  obtained  from 
the  transformation  matrix  and  the  r^^^  constants,  as  follows? 


1 

2    -1  -1 

•-l 

i 

.  0_ 

n 

0  2-2 

Therefore.,  c-. 


'1  -  " 

constant  for  the  contrast 


and  c 


is  the  estimate  of  the 


2  =  i  where  Cj[ 
It  will  be  noted  that  these  constants  are 


one-half  of  the  difference  obtained  for  the  contrast  from  the 
constants  directly  and  the  orthogonal  coefficients. 

The  sums  of  squares  for  the  two  orthogonal  single  degree  of  free- 
dom contrasts  are  now  obtained  from  B'Z-^B  in  the  usual  manner. 

S.Sqs.  for  Ration  1  vs.  Rations  2  and  3 

=  LimUm  ^  6.9677 

1.291671 

S.Sqs „  for  Ration  2  vs.  Ration  3 

=  (1/2)'  (16)  =  3J,286 
1.166668 

The  sums  of  squares  for  this  set  of  orthogonal  contrasts  do  not 
add  up  to  the  sum  of  squares  among  the  rations  shown  in  the  an- 
alysis of  variance  table  on  page    9,  i»c.,  6.9677  +  3.1'286  +  11.1112. 
This  is  true  because  of  unsqual  numbers  among  rations.    In  effect, 
the  partitioning  of  the  two  degrees  of  freedom  among  the  rations  into 
the  chosen  set  of  orthogonal  comparisons  has  created  a  two-way 


classificatiori  situation  with  unequal  subclass  numbers .    If  the  same 
means  are  conrpai^ed  mth  sreighted  coefficients  that  are  directly  pro- 
portional to  the  unequal  frequencies  the  sums  of  squares  for  the 
contrasts  will  add  to  the  sum.  of  squares  for  rations* 

A  short-cut  method  of  coiriputdjng  the  inverse  of  the  variance-covariance 
matrix  and  the  constant  estii-natess 

When  constants  are  fitted  for  all  degrees  of  freedom  among  a  set  of 
classes  or  subclasses  the  inverse  of  the  variance-covariance  matrix  and 
the  constant  estimates  can  often  be  obtained  most  easily  with  the  aid  of 
a  transformation  matrix.    If  D  is  the  diagonal  least-squares  coefficient 
matrix  for  the  classes  or  subclasses  and  K  is  the  transformation  matrix, 
then  the  inverse  of  the  variance-covariance  matrix  (C)  is  computed  from 
KD-^K'.    The  constant  estiireites  are  obtained  from  KS  =  B,  where  S  is  a 
column  vector       the  class  or  subclass  means  and  B  is  a  column  vector  of 
the  constants. 

The  diagonal  least-squares  coefficient  matrix  (D)  for  the  present 
one-way  classification  example  is 


D  » 


h  0  0" 
0     8  0 

0     0  6 


These  are  the  left  hand  members  for  the  least-squares  equations  for  the 
H  +  r^.    The  transformation  natrix  is 


1  1" 
-1  -1 

2  -1 


since  this  is  the  functional  relationship  of  p.  •♦•  rj_,  the  ration  means, 
and  the  two  kinds  of  constants  |i  and  rj[,  i.e.. 


S2 


si  - 
2 


3 


1  *       1  * 
3    ^     3    ^     3  ^ 


±  ST    +  —  So  -  ±  So 

3  3^3^ 


where  the       are  the  subclass  (ration)  means. 

Since  D  is  a  diagonal  matrix,  its  inverse  consists  only  of  re- 
ciprocals of  the  diagonal  elements.    Hence,  the  inverse  of  the  variance- 
covariance  matrix  and  the  constant  estimates  can  be  computed  individually 


bv  setting  up  the  following  table: 


^  l^il""         D-l  Constants 


1 

2 

1 

h 

.250000 

< 

3  ■ 

3 

3 

SO 
c 

1 
3 

1 
"3 

2 
3 

6 

.125000 

-1 

1 

_1 

5 

.166667 

1 

^3 

3 

"5 

5  in 

the 

last 

colnnin  of 

the  above 

table  are 

plying  each  of  the  columns  in  the  transpose  of  the  transformation  matrix  on  the 
left  in  turn  with  the  column  of  ration  means.    For  examrole, 

JI«i(h  +  6  +  5)=5 
ri  -i  [(2)(I,)  .  6  -  5]  =  -1 
^2  -  J  [-^  ^  (2)(6)  -  5]  «  1  . 

Since       +  f 2  +       =  0,  then  r^^  can  also  be  easily  computed. 

The  inverse  elements  of  the  variance-covariance  matrix  are  obtained  by- 
multiplying  each  column  of  the  transpose  of  the  transformation  matrix  by  it- 
self and  with  each  of  the  other  columns  in  turn  and  then  multiplying  this  pro- 
duct by  the  D-1  column.    For  example, 

CP^  =  ^  (,250000  +  .125000  +  .166667) 

=  .060185, 

0^2  =  i  [(2) (.250000)  -  .125000  ,166667] 

=  .0231ii8,  , 

etc. 

The  inverse,  obtained  in  this  manner,  is  presented  below: 

,060185  .0231ii8  -.018519 
>0231ijt8  0IL35I9  -,06ii8l5 
.018519    -.061815  .101852 

It  will  be  noted  that  these  elements  agree  with  the  inverse  given  in  (l). 
The  inverse  computed  in  this  manner  can  be  obtained  as  accurately  as  desired 
since  the  only  source  of  error  is  the  number  of  significant  digits  carried  in 
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the  reciprocals  of  the  diagonal  elements.  The  same  is  true  for  the  constants 
and  the  class  or  subclass  means e 

Summary  of  the  One-waj  Classification  Analysis 

Although  in  practice"  one  would  not  use  the  least-squares  procedures 
when  the  data  are  classified  in  only  one  way,  it  is  evident  that  the  same  re- 
sults can  be  obtained  with  these  procediires  as  with  commonly  used  direct  cal- 
culation procedures.    In  addition,  the  principles  and  computational  pro- 
cedures of  least-squares  analysis  can  be  more  easily  introduced  by  way  of  the 
one-way  classification  analysis,,    Extension  of  these  orocedures  from  the  one- 
way classification  to  more  complex  designs  is  straight-forward  for  one  who  is 
familiar  with'  matrix  aritrimetic.    The  extensions  to  the  more  complex  designs 
are  given  in  later  sections.    Also  some  additional  least-squares  computational 
procedures  are  introduced  in  the  remaining  sections  which  are  not  applicable 
to  the  one-way  classification.    These  are  the  absorption  of  a  set  of  equa- 
tions and  the  comDutation  of  coefficients  for  variance  components  in  the  ex- 
pectation of  adjusted  sums  of  squares  for  effects  that  have  been  absorbed. 
Procedures  for  obtaining  coefficients  for  variance  components  in  least- 
squares  sums  of  squares  or  cross-products  are  given  in  the  section  for  the 
two-way  classification. 


ONE-WAY  aASSIFI CATION  WITH  REGRESSION  OR  OOVARIANCE 

Covariance  analysis  of  data  from  a  one-way  classification  can  be  accom- 
plished by  standard  methods  even  though  unequal  numbers  exist  from  class-to- 
class.    However,  identical  results  can  be  obtained  with  least-squares  pro- 
cedures, which  directly  involve  matrix  arithmetic.    These  procedures  are 
applicable  to  a  wide  range  of  problems.    If  mean  s eparation  procedures  are  to 
be  used  on  the  least-squares  class  means  or  if  a  set  of  orthogonal  comparisons 
is  desired,  it  is  best  to  obtain  the  inverse  of  the  variance-covariance  matrix. 
An  alternative  procedure  for  obtaining  this  inverse,  which  is  often  useful,  is 
presented  in  this  section  for  the  cne-wav  classification,  when  a  continuous!/ 
independent  variable  raus't  also  be  considered.    The  method  is  applicable  also 
to  more  complex  nroblems. 


Model 

The  mathematical  model  for  the  one-way  classification  with  a  regression 
may  be  written  in  two  ways.    If  the  continuous  independent  variable  (X)  is 
to  be  taken  as  a  deviation  from  the  mean  (of  X),  then  the  model  is  as  follows: 


3/  Actually,  it  is  not  necessar^f  for  the  covariate  to  be  a  continuous  var- 
iable.   However,  except  for  the  case  of  only  two  classes  such  as  sex,  it 
is  usually  more  satisfactory  to  fit  individual  constants  for  discrete 
classifications  rather  than  regressions.    Hence,  the  covariate  will  always 
be  referred  to  here  as  a  continuous  variate  rather  than  discrete. 
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Tij  =     +  ai  +  bdij  -  x)  +  eij 


i  =  1,  2,  -  -  p 

j  «  1,  2,  ,  rii 

where:  y^j  =  the  j"^^  observation  in  the  i"th  a  class, 

H  =  the  overall  mean  for  the  y^j  when  equal  frequencies 
exist  in  each  of  the  A  classes, 
a^  =  the  effect  of  the  i'th  A  class, 

b  =  partial  regression  of  the  dependent  variable  (y)  on 
the  independent  continuous  variable  (X)  holding  the 
discrete  variable  (the  a^)  constant.    The  discrete 
variable  or  variables  are  held  constant  in  a  co- 
variance  analvsis  by  estimating  b  on  an  error  line 
basis.    In  the  one-way  classification  this  means  that 
b  is  estimated  on  an  intra-groun  basis. 
Xj^j  =  the  continuous  independent  variate  for  the  corres- 
ponding j^^  observaition.    The  Xj^j  are  regarded  as 
fixed  and  measured  without  error, 
X  ■  the  arithmetic  mean  of  the  X^j, 

the  random  errors.    These  are  assumed  to  be  in- 
dependent and  if  tests  of  significance  are  to  be 
made  they  also  must  be  assumed  to  be  nori.ially 
distributed. 


In  practice,  it  is  more  convenient  to  work  with  the  values  of  rather 
than  the  plus  and  minus  deviations  of  the  X±a  from  the  mean,  5c.  When  this  is 
done  the  descriptive  working  model  is  as  follows: 

y^j  =  a  +  ai  +  bX^j  *  » 

where  the  definition  of  all  common  terms  remains  the  same.    The  new  symbol,  a, 
is  the  population  mean  when  X  is  equal  to  zero.    Using  this  model  it  is 
therefore  necessary  to  obtain  the  estimate  of  \i,  from 

G,  =  a  +  b  X 


Least-Squares  Equations 

The  least-squares  equations  for  the  one-way  classification  with  one  con- 
tinuous independent  variable  are  shown  below  in  tabular  form: 
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a  %  b  RHM 


ai 

n. 

X. 

Y. 

H'' 

Xi 

b: 

2ZX\. 

If  the  Xj_j  are  expressed  as  deviations  from  x  then  the  equation  for  b 
becomes 

ZZ(Xij  -  x)  %  +  2Z(Xij  «  x)^^  =  ?^(Xij  -  x)yj,^ 
ij  ij  ij 

and  ji  is  estiniated  directly  rather  than  a. 

Imposing  Restrictions 

The  restrictions  imnosed  In  order  to  obtain  a  unique  solution  to  these 
equations  are  the  same  as  for  the  one-way  classification  without  the  re- 
gression being  included.    Thus,  no  restriction  is  necessary  for  the  repression 
equation.    In  the  subtraction  procedure  when  the  restriction  of  2       =  0  is 

imposed  the  column  and  row  coefficients  for  the  b  equation  are  treated  in  the 
same  manner  as  the  RHM  values. 


Inversion  of  the  Reduced  Ka.trix  and  Solution  of  the  Equations 

When  constants  for  all  the  degrees  of  freedom  among  a  set  of  classes  or 
subclasses  are  being  fitted,  in  addition  to  the  regression  (or  regressions), 
an  alternative  procedure  is  available  for  obtaining  the  inverse  of  the  var- 
iance-covariance  matrix.    In  this  case  the  regression(s)  are  being  fitted  on 
a  "within"  class  or  subclass  basis  and  can  be  calculated  from  the  equation(s) 
obtained  on  a  "within"  basis.    For  example,  in  the  simple  case  being  con- 
sidered here  the  least-squares  estimate  of  b  is  given  by 


XiYi 


ij    ^3  i 

The  diagonal  inverse  eleirent  of  the  variance-covariance  matrix  corresoonding 
to  the  b  is 

1 

-.  -— — — X?  • 

ZZ  X?.  -  Z  -1 

ij  i  "i 

The  problem  is  to  adjust  the  inverse  elements  of  the  diagonal  coefficient 
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matrix  of  the  a  +  ai  and  then  transform  this  adjusted  inverse  matrix  with  the 
approDriate  transformation  matrix.    In  order  to  obtain  the  inverse  elements  of 
the  a  +  aj_  section  it  is  first  necessarv  to  obtain  the  inverse  elements  of  the 
variance-covariance  miatrix  for  the  b-column  or-row.    These  elements  are  re- 
ferred to  as  the  G  section  of  the  variance-covariance  inverse  and  are  obtained 
as  follows! 

Qib  =  Qbi  =  __  i 


^il 


Z2  X 


The  diagonal  inverse  elements  for  the  a  +  aj_  section^ 
from 


A,  are  then  computed 


A^i  =  ~(1  -  X^Gbi) 


and  the  off -diagonal  inverse  elements  for  the  a  +  aj_  section  are  given  hy 


Aij 


"i 


or 


Aij  -  -  l-(X.G^i) 
n^  J 


The  inverse  of  the  variance-covariance  matrix  for  the  a  and  aj^  separately 
max  then  be  obtained  from  the  matrix  multiplication  KAK',  where  K  is  the 
fiinctional  relationship  matrix  between  the  classes  (the  a  +  a^)  and  the  in- 
dividual ccnstants  (a  and  aj_),  and  A  is  the  inverse  matrix  for  the  a  +  a^.  In 
the  one-way  classification,  as  was  shown  in  the  numerical  example  in  the  pre- 
vious section  on  the  one-wav  classification. 


I 

P-1 

P 

1 

p 


1  1 

-i  -1 

P  P 
p-1  .1 

P  P 


P-1  _1 
"0  P 


The  estimates  of  the  constants,  a  and  the  . 
either  of  two  wa^^s.  The  inverse  matrix  for  the 
plied  by  the  appropriate  right  hand  members,  i.e.  the  Y- 


may  then  be  obtained  in 
and  b  mav  be  multi- 


+  a 


ro%  (or  column)  at  a  tim.e,  to  obtain  the  a  +  a^. 
striction  that  Z  ai  =  0,  the  estimate  of  a  is  obtained  from 
i  Z(a  +  a^) 

i 


I  and  ZZ  Xj_jiy^j,  or 
Then  by  imposing  the  re- 
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and  the       are  then  cornputed. 

The  second  method  of  computing  a  and  the  a^  is  from  the  complete  variance- 
covariance  inverse  matrix  for  a,  a^^,  and  b  and  also  the  right  hand  members  for 
the  reduced  least  squares  matrix*    The  im'erse  elements  obtained  above 
(the  G's)  for  the  b-row  and-coluiiin  must  be  transformed  so  that  they  will  cor- 
respond to  the  variance-covariance  iratrix  for  a,  a^^,  and  b  rather  than  to 
a  +  ai,  and  b.    These  are  easilv  obtained  from  KG,  where  G  is  the  column  vec- 
tor of  the  Gib. 


Computation  of  Sums  of  Squares  and  Standard  Errors 
The  sum  of  squares  for  error  is  equal  to 

Z2  y?j  -  R(a,  a^,  b)  . 

The  reduction  due  to  fitting  all  constants  R(.  x,  aj_,  b)  is  equal  to  Sl±,  b), 

and  is  obtained  from 

a  Y.  -p  2  aiYi     ^  |^  ^ij^ij 


or  more  easilv  from  p-1 

h  (Y,  -  Y„)  +  t  1^ 


^  Y.  .  (Yi  -  Yp)  +  t    ZZ  Xijyij 


If  the  Xj_j  have  been  sq^ressed  as  deviations  from  x?  then  [i  is  obtained 
directly  rather  than  a,  and 


R(ji,  a^,  b)  =  R(a,  'ai,  b)  =  jl  Y.  +  2  ^Yi  +  -  "  ^)  7ij 

p-1 

=  (I  Y,  +    2    a  (Yj  -  Yp) 
i=l  ^ 

*  b  22  (Xij  .  x)yij 

In  the  one-way  classification  with  covariance  the  error  sum  of  squares 
can  be  obtained  asore  directly,  of  course,  from 

22  yf .  -  2       -  b^(22  Xf.  -  2  fl)  , 
ij  ±H  ij  i 

since  R(|i,  a^^,  b)  is  equal  to  the  last  two  terms  in  this  formula. 

The  sum  of  squares  among  the  A  classes,  adjusted  for  variations  in  the 
mean  of  X,  may  be  obtained  from  B'Z-^B  as  explained  in  the  previous  section. 
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la  the  one-way  classification  with  covariance  this  sum  of  squares  is  obtained 
by  direct  covariance  procedures*    The  more  general  methods  are  presented  here 
so  that  extension  to  analyses  where  standard  methods  of  analysis  are  un- 
available can  be  made  more  easily. 

The  sum  of  squares  for  testing  the  significance  of  the  regression  co- 
efficient (from  zero)  is  obtained  in  the  same  manner,  i.e.,  B'Z""1b,  but  in 
this  case  the  seemingly  complicated  matrix  multiplication  reduces  to 

b^  (.y.T.  Xf .  -  Z  . 
ij  i  ni  ' 

Standard  errors  for  the  least-squares  means  for  the  A  classes,  jS  +  fi^, 
may  be  obtained  from  the  variance-covariance  inverse  matrix  for  (X,  ai,  and  b 
as  follows: 

where  ffg  is  the  mean  square  for  error  and  the  superscripts  on  the  C's 
identify  the  inverse  elements . 

The  inverse  elements  th&t  would  have  been  obtained  for  the  ;ji-row 
(or-column)  if  the  X's  had  been  taken  as  deviations  from  3S  can  be  computed, 
if  desired,  from 

If  these  are  available,  the  standard  errors  of  the  jl  +  t^^  can  easily  be  com- 
puted from 


Computation  of  standard  errors  of  differences  among  the  least-squares 
means  Beed  not  involve  the  inverse  elements  in  the  p  or  a  row  (or  column) 
since  „  „    

iU^rai)  =.  (fl+aj)       ai=aj     ^  /  e 

These  standard  errors  of  differences  among  the       are  required  if  mean  sep- 
eration  procedures  are  to  be  employed.    The  inverse  elements  for  the  tj^ 
section  can  be  transformed,  as  explained  in  the  previous  section,  if 
orthogonal  comparisons  among  the       are  desired. 

Humerij^l  Example 

The  same  set  of  data  used  in  the  previous  section  is  repeated  below  but 
with  the  addition  of  a  covariate  as  follows: 
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Ration  No, 


2  3 


Pig  No. 

W 

y 

W 

y 

w 

y 

1 

5 

3 

ii 

5 

8 

7 

2 

9 

5 

7 

6 

7 

6 

3 

11 

0 

u 

c. 

3 

u 

h 

3 

8 

7 

2 

3 

10 

8 

8 

6 

O 

o 
d 

o 

c. 

I, 

7 

12 

9 

8 

5 

8 

Totals 

28 

16 

hS 

18 

30 

30 

Means 

6 

6 

5 

1)  Ms-thematical  model? 

»  a  +       +  b  Wj^j  + 

1  «  1,  2,  3 

j  =  1.  2,  8 

where : 

yj^j  =  the  gain  of  the  j"^^  barrow  on  the  i'th  ration, 

a  »  the  overall  mean  when  Wj^j  =  0, 
r^  =  the  effect  of  the  ration, 

b  »  partial  regression  of  gain  on  initial  weight, 

Wj^j  "  the  initial  weight  for  the  j'^^  pig  on  the  ration, 

eij  «  random  errors. 

It  should  be  pointed  out  that  the  assumption  of  random  errors  requires 
thet  the  regression  be  linear  and  homogeneous  from  one  ration  to  another. 

2)  Least-squares  equations: 


a 

?1 

?2 

2^ 

s 

RHM 

a: 

18 

8 

,6 

106 

9h 

\x 

h 

0 

0 

28 

16 

rg: 

8 

0 

8 

0 

li8 

h8 

6 

0 

0 

6 

30 

30 

bs 

106 

28 

1,8 

30 

832 

656 
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3)    Iirposing  the  restriction  that  Z  r^^  »  0: 

The  reduced  least-squares  equations  are  given  below  in  tabular  form: 


a 

T2 

b 

RHM 

a: 

18 

-2 

2 

106 

91 

ri: 

-2 

10 

6 

-2 

-lU 

2 

6 

Ih 

18 

18 

b: 

106 

-2 

18 

832 

656 

it)    Inverse  of  the  reduced  least-squares  equations  and  constant  estimates: 

The  inverse  of  the  reduced  set  of  least-squares  equations  obtained  with 
an  ordinary  method  of  inversion  is  given  below: 


Matrix  Inverse 


a 

ri 

f2 

a: 

.212003 

.053li5l 

-.018519 

-.030303 

.053h5l 

olii8569 

-.061815 

-.005051 

rg: 

-.018519 

-.06h8l5 

.101852 

.000000 

b: 

-.030303 

-.00505^.. 

.000000 

.005051 

(2) 


The  constant  estimates  may  be  obtained  by  multiplying  each  column 
(or  row)  of  the  inverse  by  the  reduced  HHK's.    The  constants  obtained  are: 

a  -    1.7879        T2  -  1.0000 

^  =  -1.535U        %  -  ,S37n 

The  overall  mesm  p,  is  then  obtained  from 

»  a  +  S  w 

=  1.7879  +  (.535U) (5.8889) 
=  U.9108 

The  estimate  of  is 

^3  "         +  ^2) 

«  -(-1.535U  +  1.000) 
=  .5351  . 
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Since, 


^  -s-  r^^  «  a  +  S  w  +  , 
the  diagonal  inverse  element  for  p.  *  rj^can  be  computed  from 

Another  method  of  computing  this  inverse  diagonal  element,  which  is 
necessary  if  the  standard  error  of  p.  +  r^^  is  desired,  is  possible  from 

However,  the  inverse  elements  involving  \i  are  not  directly  available  from 
the  inverse  matrix  vjnless  the  Wj^j  have  been  expressed  as  deviations  from 
the  overall  mean.    These  inverse  elements  for  the  {i-row  and-coluran  can 
be  computed  from  the  available  inverse  elements,  as  follows! 

CM*t  •  oaa  +  2w  cab  +  52  c^b 
CUTi  .  cori  ^  -  c^ri  , 

In  the  present  problem, 

C*^-  -  .212003  +  (2)(5,88888o)  (-.030303)  ^  (5.888889)^  {.00^^) 

-  .060265  , 

C?*^  -  .053h5l  ^  (5.888889) (-.005051) 

-  .023706  , 

CF*2  «.  -,018519  +  (5.888889)  (.000000) 

-  -.018519  . 

(a)    Alternative  procedure  for  computing  the  inverse  matrix.    As  indicated 
above,  a  method  of  obtaining  the  inverse  of  the  variance-covariance 
matrix  more  easily  in  some  cases  is  available  when  constants  for  all 
degrees  of  freedom  among  the  classes  or  subclasses  are  being  fitted. 
This  method  was  described  above  for  the  one-way  classification  when 
a  regression  coefficient  is  being  fitted  and  will  now  be  illustrated 
with  the  computational  example. 

The  inverse  element  for  the  b,  S  position  of  the  inverse  matrix 

is 

ebb  .  1  

ij  i  n^ 

1 


=  .00505051 

which  checks  with  the  value  for  this  element  obtained  in  the  sinral- 
taneous  inversion  of  the  comolete  matrix.  The  inverse  elements  for 
the  b-column  and-row  for  the  a  +  r^  are  obtained  as  follows  J 


^  L^j        i  ^i 

-  -(,00505051)  (25) 

-  -.03535357  , 

=         ^  .(.00505051)  i^) 

=  -.03030306, 

G3b  =  G^3  „  (.00505051)  (^) 
»  -,02525255  . 

The  adjusted  diagonal  inverse  elements  for  a  +  r^^  are  then  computed 
as  follows: 

All  .  i.  (1  .  w^G^l) 
ni  1 

.1  [1  -  (28) (-.03535357)] 

=  .Ii97U7l99  , 
a22  »  I  [1  -  (LiS) (-.03030306)] 

=  .30681836  , 
a33  »  I  [1  -  (30) (-.02525255)) 

»  .292929I12. 

The  adjusted  off -diagonal  inverse  elements  for  the  a  + 
(which  were  zero)  are  thei  computed  as  follows? 

a12  -  a21  .  .  1.  Wt  G^2 
ni 

»  -  1  (28) (-.03030306) 

h 

.21212II12  , 
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a13  =  a31=™^  (28)(-, 02525255) 

=  ,17676785  , 
a23  =  a32»«1  (L8)(-..02525255) 

0 

-  .15151530. 


Transformation  of  the  a  +  r segment  of  the  inverse  matrix  is  now 
accomfplished  as  follows: 


KAK'  »  I 


1  1 

^1 

2  -1 

-1 

-1  2 

»1 

Ii9717h99  .21212112  .17676785 
,21212112  .30681836  .15151530 
,17676785    .15151530  .202929li2 


K' 


.88636ii26     .670l!55o8  .62121257 
.60606071    -,03li09062  -.09090902 
-.25000000      .25000000  -.16666667 


J_  1  -1  2 
3    1  -1  -1 


.211200355 
.05315121 
-.01851852 


.05315121  -.01851852 
.11856903  -,06Ii8ll8l 
..06L.811i81  .10135185 


These  inverse  elements  check,  within  rounding  errors,  with  those  obtained 
directly  (see  matrix  inverse  (2))  from  matrix  inversion  procedures. 
Actually,  less  rounding  errors  will  accumulate  in  the  computation  of  the 
inverse  elements  with  this  indirect  method  than  when  the  inversion  is 
completed  on  the  least-squares  equations  by  an  elimination  method. 

The  transformed  inverse  elements  for  the  b-cclumn  (and  row)  are  ob- 
tained from 


KG  = 


-.03535357 

-.03030306 

-.02525255 

1  1 

2  -1 
■i    2  -1 


«.03030306 
-.00505051 
.00000000 


which  also  check  with  the  values  for  these  elements  that  were  obtained 
directly  (see  matrix  inverse  (2)). 

(5)    The  analysis  of  variance: 

(a)    Sum  of  squares  for  error. 


Error  S.  Sqs.  =  T.T  yf^  -  R(a,r^,b) 


568  -(1.7879)(9li)-(-1.535h)(-lli) 
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-(l.OOOO)(l8)-(.535Ii)(656) 
=  568  -  558.7806  »  9.219h. 
(b)    Sum  of  squares  for  rations  (R), 
Ration  S.Sqs.  =  B^Z"'^  B 


=  fl  515),   1  oooni  r '1^8569   -.06h8l5l  tl.535h] 
[-1.535h    1.000C|  ^^^-^^^     .101852]  [l.OOOoj 

-  Ti  1  nmr^    TiiiT^     5.929L3l]  Pl.535h| 

-  [-1.535h   1.0000[    j^^T^^  I3.59ll>h7j  [l.ooooj 

=  [-8.376910    L.1.87399]  fi:^^^^]] 
»  17.3U93 

(c)    Sum  of  squares  due  to  regression. 

Regression  S.Sqs.  =  B'Z"^  B 

=  ii535L2l  =  (.5351)"  (198) 
.005051 

=  56.7573 

The  analysis  of  variance  is  as  follows: 

Source  of  Variation    d.f.      S.Sqs.         M.S.  F 

Rations  2     17.3h93     8.67L6  13.17** 

Regression  1     56.7573    56.7573  86.19^ 

Error  Ih       9.219li  .6585 

Differences  dre  to  rations  are  now  highly  significant!  whereas 
in  the  first  section,  using  the  same  data  but  ignoring  the 
initial  weight,  ration  differences  were  not  significant.  The 
error  term  has  been  reduced  from  Ii.IiOOO  to  0.6585. 

Standard  errors  and  individual  comparisons : 

(a)    Standard  errors  of  the  least  squares  means,  \l  and  jl  +  rj^. 


sfk  -  V  (.060265) (.658 5) 
=  .199 


sjl+ri  =  y/  [.060265  +  .llj8569  +  (2) ( .023706]]  (.6585) 
-  .lill 
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s(l+r2  =  \/[.060265  +  .101852  ^  (2)  (-.Ol85l9)J  { .6585) 
=  .287  , 

sil+r3  =  •yfo60265  +  .120791  +  (2)  (»,  005lP?]  (.6585)" 
=  .335 

(b)    Individual  comparisons  among  the 
(i) 


"t"  test 

rT_-.r2  = 

-2. 5351 

t  = 

^1-^3  ' 

-2.0708 

t  = 

_  2.0708  ^  r>  RAW- 

h'h  = 

.ii6l!6 

t  = 

(ii)    Mean  separation  with  Duncan's  I^ltiple  Range  Test  (,05) 


Comparison 

Product 
Differences 

c^e  ^t>in2 

r2  vs 

,L6h6 

2.596 

1.21 

2.)a6 

^2 

2,535b 

2.294 

5.82* 

2.58 

h  h 

2.0708 

2.140 

4.43* 

2.b6 

(iii)    Orthogonal  contrasts. 

Suppose  that  it  is  desired  to  test  the  significance  of 
linear  and  quadratic  effects  among  the  rj_.    In  this  case  the 
transformation  matrix  for  the  r^  segment  of  the  inverse  matrix 
(including  the  third  row  and  column)  is. 


and 


0  -21 
[l  -2.  ij 


KZK» 


1  [2  0-2 
"  "h  [l  -2  1 


.11x8569  -.061.815  -.OP375U 
..06Li8l5  .101852  -.037037 
..08375b    -.037037  .120791 


1  r.b6b6b6  -.055556  -.b0909o1  ip  l" 
b  Ll9bbb5   -.305556     .llllllj    bi  ^  'i 
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1_ 
16 


1.7h7b72  .166668 


,166668  .916668 


r.  109217  .010117' 
Lo10!j17    « 057292 


S.Sqs.  Linear 


S.Sqs.  Quadratic 


^j^l(2)(-l.535U  ™i(2)(.535h| 


(l.0351i)^ 
.109217 


.109217 
=  9.816-x^  . 


I: 


1.535h)  -i(2) (1.0000) 


.057292 

(-.7500)^  «.  9.8l8^«f  . 
.057292 


Again,  it  will  be  noted  that  the  total  of  the  sums  of  squares  for  the 
orthogonal  contrasts  is  not  equal  to  the  sum  of  squares  for  rations* 


Summary  of  the  One-way  Classification  Analysis  with  Co variance 

The  least-squares  procedures  (method  of  fitting  constants)  for  analyzing 
data  with  unequal  numbers  have  been  applied  to  the  analysis  of  data  classi- 
fied in  only  one  way  but  also  considering  a  continuous  covariate.  Detailed 
generalized  procedures  have  been  presented  for  the  computation  of  the 
variance-covariance  inverse  matrix,  the  constant  estimates,  sums  of  squares 
for  the  analysis  of  variance,  standard  errors  of  least-squares  means  and  tests 
of  significance  for  individual  comparisons  or  for  orthogonal  contrasts  among 
the  class  effects.    These  procedures  are  first  presented  in  algebraic  form 
for  this  design  and  then  numerically  using  a  simple  exair.ple.    The  basic  pro- 
cedures described  here  are  directly  applicable  to  more  complex  designs  which 
will  be  discussed  in  later  sections* 


Ti/ZO-WAY  aASSin CATION  WITHOUT  INTERACTION 

When  disproportionate  numbers  exist  among  subclasses  in  the  two-way 
classification  the  method  of  fitting  constants  by  least-squares  should 
usually  be  employed.    For  the  special  case  of  onl.y  two  classes  of  A  or  two 
classes  of  B,  short-cut  rrethods  for  obtaining  sums  of  squares  for  the  analysis 
of  variance  have  been  developed  and  are  given  in  several  statistical  books. 
The  least-squares  method  of  fitting  constants  for  the  general  case  of  the  two- 
way  classification  without  interaction  will  be  presented  here. 
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The  model  for  the  two-way  classification  when  the  interaction  of  A  and  B 
is  assumed  non-»existent  is 

yijk  -     +  ai  -»■       +  eijk   .  (3) 

i  =  1,  2,  p 

J  =  1,  2,  q 

=  the  k^^  observation  in  the  j'th  B  class  and  the  i"*^^  A  class, 
»  overall  mean  when  equal  subclass  numbers  existj 
=  effect  of  the  i"th  A  class, 
=  effect  of  the  j^h  B  class, 
»  random  errors,  assumed  to  be  NID(o,o-|')» 

This  model  reduces  to  that  for  the  randomized  block  design  only  when  k=l  for 
each  combination  of  i  and  j« 

The  aj_  and  the  bj  may  either  be  regarded  as  fixed  or  random  effects.  If 

both  classes  of  effects  are  fixed,  the  model  is  referred  to  as  the  "fixed" 
model,  the  linear  hyoothesis  model,  or  Model  I  of  Eisenhart,    If  both  the 
and  the  bj  are  randomly  drawn  from  some  infinite  population,  such  as  cows, 
dams,  sires,  etc.,  the  model  is  referred  to  as  the  "random"  model  or  Model  II 
of  Eisenhart,    If  the  effects  for  one  of  the  classes  are  fixed  and  the  other 
random,  the  model  is  referred  to  as  the  •'mixed"  model. 

If  a  class  of  effects  is  to  be  regarded  as  fixed  (such  as  the  effects  of 
a  set  of  selected  treatments),  then  the  investigator  is  interested  in  least- 
squares  means,  standard  errors,  tests  of  significance,  mean  separation  pro- 
cedures and  possibly  orthogonal  comparisons.    On  the  other  hand,  if  a  class  of 
effects  is  regarded  as  random,  the  investigator  is  primarilv  interested  in 
estimating  the  variance  component  from  that  source.    Hence,  with  random, 
effects  it  is  desirable  to  have  a  large  number  of  degrees  of  freedom  among  the 
effects  in  order  to  estimate  the  variance  (or  pjssiblv  covariance)  comoonent 
more  accurately.    In  both  cases,  however,  it  is  desirable  to  obtain  the  leas"D- 
squares  sums  of  squares.    With  fixed  effects,  it  is  essential  to  obtain  the 
inverse  of  the  variance-covariance  matrix  in  order  to  make  individual  com- 
parisons.   In  many  cases,  it  is  also  desirable  to  compute  the  inverse  seg- 
ment (s)  of  the  variance-covariance  matrix  for  the  random  effects.    The  general 
least-squares  nrocedure  for  both  kinds  of  effects  will  therefore  be  presented 
and  followed  by  an  absorption  procedure  v*iich  reduces  the  number  of  com- 
putations when  the  random,  effects  are  associated  with  a  large  number  of 


where  s 

•^ijk 
®ijk 
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degrees  of  freedom. 


Least-Squares  Equations 

The  least-squares  equations  for  the  two-wav  classification  are  given  in 
tabular  form  below: 


^i 

h 

RHM 

\xi    n.  • 

J^i. 

n.. 

Y,. 

Yi. 

by.  n.j 

Y.^ 

It  should  be  noted  that  the  sum  of  the  coefficients  for  the  a^  in  the  |ji  equa- 
tion equals  the  sum.  of  the  coefficients  for  the  bj  and  the  coefficient  for  jx. 
In  addition,  the  sum  of  the  coefficients  for  the  b^  in  an  a^  equation  equals 
the  coefficient  for  the  a^  and  the  total  of  the  RM's  for  the  ai  equations  and 
the  bj  equations  equals  the  grand  total  of  the  y±^^  Y,,*    In  order  to  solve 
these  equations  or  to  invert  the  coefficient  matrix  it  is  necessary  to  impose 
restrictions  on  the  a-^  and  one  on  the  b j  • 

Imposing  Restrictions 

A  comm.on  restriction  on  these  equations  is  to  set  ap  -  bq  =  0  and  delete 
the  equations  and  columjis  for  ap  and  bq.    The  inverse  of  the  resulting  re- 
duced coefficient-m3.trix  must  be  transformed  if  the  standard  errors  of  the 
(1  +  a^  or  the  ji  +  S-j  are  desired,  or  if  the  coefficients  of  variance  coi?r^ 

ponents  in  the  expectation  of  m.ean  squares  are  to  be  obtained  by  mec^ns  of  a 
short-cut  procedure  which  will  be  described.    Hence,  it  is  generall;^'-  pre- 
ferred to  imoose  the  restrictions  that  Za^;  =  zB^=0  and  to  make  the 

i  J    ^  ! 

necessary  subtractions  in  the  least-squares  equations  before  inversion  of  the 
ina.trix.    It  is  much  easier  to  complete  these  necessary  subtractions  jbefore 
inversion  than  to  transform  the  inverse  matrix.    In  addition,  the  constants 
obtained  from  the  direct  solution  of  the  reduced  matrix  will  be  in  ihe  form 
desired,  in  most  cases,  i.e.,  \l  is  estimated  directly  rather  than  a^  (i  +  ap 

+  6q,  the       are  estimated  directly  rather  than  as  a^^  -  alp,  and  the  Sj  are  es- 

tiFiated  directlv  rather  than  as       -  S^. 

When  the  restrictions  are  imposed  that  Zli  =  ZS^==0  the  coefficients 

of  one  equation  in  the  ai,  say  ap,  and  one  equation  in  the  bj,  say  bq,  must 
be  subtracted  from  other  coefficients  by  columns  and  rows.  The  subtraction 
of  the  ap  coefficients  is  done  onlv  within  the  a^  columns  of  coefficients  and 
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the  subtraction  of  the  Sq  coefficients  is  done  only  in  the       ccluirms.  j^fter 

corrmletion  of  this  subtraction  by  columns,  the  coefficients  and  RHM  in  the  re- 
sulting ap  equation  are  subtracted  from  the  corresponding  coefficients  and  the 
RHM's  in  zhe  other  aj_  equations.    A  similar  procedure  is  followed  for  the  bq 
equation  coefficients.    After  these  subtractions  are  completed  there  will  re- 
main 1  +  p-  l  +  q-  l  or'P  +  q  -  1  svranetrical  equations.    The  nurriber  of  re- 
rnaining  equations  is  also  the  number  of  degrees  of  freedom  among  the  among 
the       and  one  additional  for  jl. 

Inversion  of  the  Reduced  f^triy  and  Solution  of  the  Equations 

A  least-squares  matrix  for  the  two-way  classification  can  usually  be  in- 
verted without  difficulty  using  ordinary  procedures  and  in  single  precision 
arithmetic  (i.e.,  using  7  or  8  significant  digit  arithmetic)  unless  con- 
founding of  the  effects  is  complete  in  one  or  more  cases.    With  experience  it 
becomes  easy  to  recognize  complete  confounding  of  an       and  a  bj  effect  (if 
such  should  exist)  from  the  n^j  section  of  the  original  least-squares  equa- 
tions. 

The  reduced  set  of  least-squares  equations  can  usually  be  solved 
directly  to  obtain  estimates  of  the  constants,  \i,  a^,  and  bj,  at  the  same 
time  the  inversion  is  being  made.    The  sip  and  6q  constants  are  obtained  from 

5p  =  -z  Si 

bq  =  -5  6j 

and  the  inverse  elements  for  the  ap  and  bq  columns  and  rows  may  be  obtained  in 
a  manner  similar  to  that  exDlained  in  the  first  section, 

Com)3uting  Sums  of  squares  for  the  Analysis  of  Variance 
The  total  reduction  in  sum  of  squares  is 

p-1  q-1 
RiM,  ai,  bj)  =  a  Y.,        2    %  -  Y_)+    I         (Y..  -  Y.  ) 

i«l  ^'      j=l    ^  ^ 

and  the  error  sum  of  squares  is  equal  to 

when  interaction  effects  are  non-existent. 

The  least-squares  or  adjusted  sum  of  squares  for  testing  the  sig- 
nificance of  differences  among  the  A  classes  can  most  easily  be  computed  in 
the  two-way  classification  from  R(|ji,  a^,  bj)  -  R(|i,,  bj),  where 
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the  "between"  uncorrected  sum  of  squares  for  B,    Likewise,  the  least-squares 
sum  of  squares  for  testing  the  significance  of  the  differences  among  the  B 
classes  is  obtained  from 


a^,  bj)  -  R(n,  a^),  where 


the  "between"  uncorrected  sum  of  squares  for  A.    These  sum.s  of  squares  are 
also  computed  when  variance  components  are  to  be  estimated.    In  this  case, 
coefficients  for  variance  components  in  the  expectation  of  these  sums  of 
squares  must  be  computed.    Methods  for  making  these  computations  are  con- 
sidered in  a  later  sub-section. 

In  the  two-way  classification  analvsis,  the  least-squares  sum  of  squares 
for  the  interaction  of  A  and  B  can  be  computed  indirectly  even  though  the 
interaction  constants  are  not  fitted.    This  is  true  because  the  sum  of  squares 
for  interaction  is  equal  to  R jji,a;i^,b j,  (ab)j_      -  R(p,,aj^,bj)  and 


the  "between"  uncorrected  sum  of  squares  for  the  AB  subclasses.    The  same  is 
true  in  least-squares  analyses  with  more  complex  models.    That  is,  when  con- 
stants for  all  effects  among  the  subclasses  are  fitted,  except  the  highest 
order  interaction,  the  sum  of  squares  for  this  interaction  can  be  obtained 
indirectly  from  the  difference  between  the  subclass  sum  of  squares  and  the 
total  reduction  due  to  fitting  constants,    A  similar  indirect  procedure  can 
be  used  to  compute  the  interaction  when  partial  regressions  for  continuous 
variates  are  also  fitted  at  the  same  tim.e,  provided  constants  for  all  effects 
among  the  subclasses  have  been  fitted  except  the  highest  order  interaction. 

The  error  sum  of  squares  for  testing  the  significance  of  interaction 
effects  is 


the  "within"  subclass  sum  of  squares. 

When  the  interaction  effects  are  found  to  be  significant,  by  means  of  the 
test  of  significance  in  the  analysis  of  variance,  the  main  effects  and  the 
corresponding  sums  of  squares  are  biased.    This  is  true  because  of  the 
partial  confounding  of  the  main  effects  and  the  interaction  effects  due  to 
unequal  subclass  numbers.    If  interaction  effects  exist  the  main  effects  must 


R^,ai,bj,(ab)j^jj 


=  2Z 


therefore  be  estimated  from  averaging  over  the  subclass  means  rather  than 
from  the  class  means e    \Jhen  interaction  effects  exist  in  the  two-way  classifi- 
cation and  all  subclasses  are  filled j  the  weighted  squcires  of  means  procedure, 
described  in  mtinv  textbooks  on  statistics,  may  be  used  to  compute  the  sums  of 
squares  for  the  main  effects  adjusted  for  the  interaction.    The  weighted 
squares  of  means  analysis  is  equivalent  to  the  complete  least-squares  analysis 
when  constants  have  been  fitted  for  all  m?in  effects  and  interactions.  This 
will  be  shown  to  be  the  case  in  the  next  section  where  consideration  is  given 
to  the  fitting  of  interaction  constants* 

When  all  subclasses  of  A  and  B  are  rot  filled,  the  degrees  of  freedom  for 
the  interaction  sum  of  squares  must  be  obtained  by  difference  rather  than  from 
{p-l)(q-l)»    The  nujnber  of  degrees  of  freedom  for  interaction  in  any  case  is 
equal-  to  the  number  of  AB  subclasses  minus  the  number  of  A  classes  minus  the 
number  of  B  classes  plus  one. 


Standard  Errors,  Orthogonal  Comparisons  and  Mean  Separation  Procedures 

Detail  procedures  for  computing  standard  errors  of  constant  estimates 
and  linear  functions  of  constants  estimates  are  the  same  as  those  described 
in  the  previous  sections  for  the  one-wav  classification.    Likewise,  methods 
for  testing  the  significance  of  pairwise  differences  among  least-squares 
means  and  orthogonal  contrasts  as  used  in  the  one-way  classification  are  also 
applicable  to  the  two-wav  classification »    In  either  case  the  inverse  of  the 
variance-covariance  matrix  is  required. 


Estiiratica  of  Variance  Components 

When  the  aj_  and  the       are  random  samples  from  nooulations  of  these 
effects,  the  estimates  of  the  variance  comroonents  associated  with  these 
effects  are  of  r,rimar\^  interest.    These  variance  component  estimates  are  ob- 
tained by  equating  the  least-squares  sums  of  squares  or  mean  squares  to  their 
corresponding  expectations  and  solving  the  resulting  equations.    In  the  twO" 
way  classification  without  interaction,  the  analysis  of  variance  is  as 
follows : 

Source  of  Variation       d.f«  S.Sqs. 

A  p-1      R(^,ai,bj)  -  R(M.,bj) 

B  q-1      R(M,,aj_,b^)  -  H(vi,ai) 

Error  n..-p-q+l  ZZZ  yl^y.  -  R(|j,,aj_,bj)  a| 

The  coefficients  of  the  variance  components  in  the  expectation  of  the  mean 
squares,  E(MS),  may  be  computed  with  either  of  two  methods.  One  method  (the 
first  to  be  described)  mav  be  regarded  as  the  direct  method  and  the  other  as 
an  indirect  method.  The  direct  method  of  computing  the  coefficients  of  var- 
iance components,  the  k's,  parallels  the  direct  method  of  computing  the  suras 
of  squares  and  the  indirect  method  of  comrouting  these  coefficients  parallels 
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E(MS; 
4  +  kiog 


the  indirect  method  of  computing  the  sums  of  squares. 

It  will  be  recalled  that  any  least-squares  sum  of  squares  mav  be  comr)uted 
directly  from  the  matrix  multiplication  B'Z-^B,  where  B'isa  row  vector  of  a 
set  of  constants,  Z-1  is  the  inverse  of  the  square  symmetrical  segment  of  the 
variance-covariance  inverse  corresponding  to  this  set  of  constants  and  B  is  a 
column  vector  of  the  constants.    This  is  true  for  the  two-vray  classification 
as  well  as  for  any  other  least-squares  analysis.    However,  in  this  case  the 
sums  of  squares  for  A  and  B  can  most  easily  be  comnuted  indirectly,  i.e., 
from  differences  in  reductions  in  sums  of  squares  as  indicated  above* 


Direct  Method  of  Computing  the  k's 

If  the  restrictions  are  imposed  that  Z  ai  =  2  bj  =  0  in  the  original 

equations  and  the  least-squares  sums  of  squares  are  computed  from  B'Z-^B,  the 
variance  component  coefficients  lrj_  and  k2  are  easily  computed  as  follows? 

2      p    i  A        p.l  ij^A.^. 

The  superscripts  on  Z  identifv  the  elements  in  the  matrix  inverse  to  the 
square  symmetrical  segment  from  the  variance-covariance  inverse  matrix  and 
the  subscripts  A  or  B  identifv  the  segments  for  the  A  effects  and  the  B 
effects. 


Extension  of  the  direct  method  of  comouting  the  coefficient  for  the  major 
variance  component  in  each  line  in  the  E(MS)  for  more  complex  designs  is 
straight-forward.    In  general,  the  formula  for  computing  these  coefficients  is 

k  =  i  (ZZ^i  EZzij), 
"1    i  d.f.  ij,  i^j 

where  d.f.  is  the  number  of  degrees  of  freedom  for  that  line  in  the  analysis 
of  variance  and  m  is  the  number  of  classes  or  subclasses.     This  equation, 
however,  is  not  generally  applicable  to  interactions. 

Indirect  Method  of  Computing  the  k's  1^ 

The  indirect  method  is  accomplished  by  first  computing  the  coefficient 
for  the  variance  component  in  the  reduction  in  sum.  of  squares  due  to  fitting 
all  effects  except  the  set  being  considered.    This  coefficient  is  then  sub- 
tracted from  n...  and  the  result  is  divided  bv  the  degrees  of  freedom  for  t)?at 
source  of  variation  to  obtain  the  coefficient  for  the  variance  component.  The 
problem  consists  of  computing  the  coefficient  for  the  variance  component  in 
the  appropriate  reduction  in  sum  of  squares.    This  is  accomplished  by 

h/  Henderson,  C.  R.    Estimation  of  variance  and  covariance  components , 
"Biometrics  9:  226-252.  1953. 
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comDuting  the  sum  of  crossproducts  between  corresponding  elements  of  two 
square  ina trices. 

The  inverse  of  the  variance-covariance  matrix  when  all  effects  except  the 
set  for  which  the  variance  comoonent  coefficient  is  to  be  computed,  are  in- 
cluded in  the  model  is  on§  of  the  matrices  required  for  use  of  this  indirect 
method.    For  example,  in  the  two-way  classification  analysis,  if  the  co- 
efficient for  a|  is  desired,  then  the  inverse  of  the  variance-covariance 
matrix  for  the  the  model  yjj^  =  ji  ■^  bj  +  e-^k       required.    The  "associated 
sums"  matrix  is  the  second  square  matrix  required  for  use  of  this  indirect 
method  and  is  computed  from_  the  matrix  multiplication  N  N',  where  N  is  the 
segment  of  the  original  comolete  set  of  least-squares  equations  which  contains 
the  coefficients  associating  the  effects  under  consideration  with  all  others 
in  the  model.    For  example,  in  the  two-way  classification,  N  is  the  n±^ 
section  of  the  least-squares  equations  to  the  right  of  the  main  diagonal  and 
N'  is  the  n^j  section  to  the  left  of  the  main  diagonal.    If  the  elements  in 
the  inverse  matrix  for  the  least-squares  equations  for  the  reduced  model  are 
Rij  and  the  ele^rents  in  the  product  matrix,  N  N',  are  N^j,  the  coefficient 
for  a  variance  component  in  the  E(^S)  of  the  analysis  of  variance  is  given  by 

n.,  - 

degrees  of  freedom 


The  quantity 

may  also  be  computed  from  the  sum  of  the  diagonals  in  the  matrix  resulting 
from  the  multiolication 

where  R"^  is  the  inverse  of  the  variance-covariance  matrix  for  the  reduced 
model. 


The  indirect  method  of  computing  the  coefficients  for  variance  com- 
ponents is  more  easily  completed  than  the  direct  method  for  the  two-way 
classification  analysis,  as  was  the  case  in  computing  the  sums  of  squares  for 
tests  of  significance.    The  indirect  method  of  computing  the  coefficients  for 
a|  and       reduces  to 


^1  =  qrr 


(n,, 


1 

2      p-1  j  n.^ 

for  the  two-way  classification  without  interaction, 


-3  7- 


Absorption  of  the  \i+a^  Equations 


Although  the  inverse  of  the  variance-covairiance  coefficient  matrix  for 
the  complete  irodel  is  required,  it  can  often  be  obtained  more  easily  by  a 
partitioning  procedure  than  from  direct  inversion  of  the  reduced  set  of  least- 
squares  equations.    The  absorption  of  the  ji+a^  equations  into  the  bj  equations 
is  only  one  step  in  this  partitioning  procedure. 

It  was  shown  in  the  one-way  classification  that  the  equations  for  \L+a± 
are  identical  with  the  equations  for  the  a^.    When  ji  is  combined  with  the  a.^ 
it  is  unnecessary  to  impose  a  restriction  on  the  ai  since  there  are  p  degrees 
of  freedom  associated  with  the  VL+ai  but  only  p-1  degrees  of  freedom  associated 
with  the  ai  alone.    This  being  true,  and  with 

+  ai  =  ^  (Yi,  -  2  nijbj) 

it  is  fairly  easy  to  absorb  the  equations  for  \i  +  &±  into  the  equations  for 
the  by 

If  the  new  coefficients  for  the  bj,  after  absorption  of  the  |i  +  a^,  are 
C(bjbj)  and  C(b-jbji)  and  the  new  right  hand  members  are  S(bj),  the  reduced 
equations  are  as  follows: 


",A  

bl 

^2 

^3 

K 

RHM 

bi: 

C(bibi) 

C(bib2) 

C(bib3) 

. . .  C(bibq) 

S(bi) 

b2: 

C(b2bi) 

C(b2b2) 

C(b2b3) 

...  C(b2bq) 

S(b2) 

• 

C(b3bi) 

C(b3b2) 
• 

C(b3b3) 

,  o  •  €(b3bq) 

S(b3) 

• 

• 

C(bqbi) 

• 

C(bqb2) 

• 

C(bqb3) 

...  C(bqbq) 

S(bq) 

The  new  coefficients  and  RHM's  are  computed  as  follows: 

2 

C(bjbj).n.j.Z^ 

"i.i"ij' 

C(bjbj»)  -  C(bj,bj)  —2    m,       *  and 


S(bj)  =1,^-2 


i  "i. 
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Unless  the  number  of  |.i  +       equations  is  large  and/cr  the  number  of  bj 


equations  is  large  this  absorption  can  be  completed  on  an  electric  desk  cal- 
cu3.ator.  Prograins  for  completing  this  absomtior  are  now  available  for  high 
speed  electronic  computers  which  peririt  a  wide  variation  in  p  and  q. 

No  restriction  needs  -to  be  imposed  on  the  least-squares  equations  until 
after  absorption  has  been  completed.    After  absorption  it  will  be  noted  that 
the  suiTi  of  the  new  coefficients  for  the  b  j  sum  to  zero  bv  rows  and  columns 
and  that  the  sum  of  the  RHM's,  by  column,  sum  to  zero.    This  provides  a  check 
on  the  absorption  process,  but  r^oes  show  that  dependencies  exist  in  the  re- 
duced set  of  equations.    The  dependencies  are  reeved  bv  imposing  the  re- 
striction that  Z  Sj  =  0,  whereby  the  coefficients  for  £q  are  subtracted  from 

the  coefficients  of  the  other  bj  and  the  resulting  coefficients  in  the  6q 
equation  are  then  subtracted  from  the  resulting  coefficients  in  the  other  bj 
equations.    The  S(bjj)  is  also  subtracted  from  the  other  S(bj)  in  each  of  the 
RHM  columns. 

The  inverse  of  the  reduced  set  of  equations  for  the  bj,  after  absorption 
of  the  |j,  +  ai  and  after  imnosing  the  appropriate  restriction,  is  exactly  the 
same  as  the  inverse  elements  for  the  bj,Sj  segm.ent  in  the  inverse  matrix  of 
the  complete  variance-co variance  matrix:.    The  constants  obtained  for  bj  are 
also  identical  to  the  bj  constants  that  would  be  obtained  by  solving  the 
original  set  of  equations.    In  effect,  the  constants  for  the  bj  are  fitted 
on  a  within  A  class  basis  in  either  case. 

The  reductirn  in  sum  of  squares  obtained  bv  multinlving  the  Sj  by  the 
S(bj)  -  S(bq)  is  equal  to  the  least-squares  sum  of  squares  for  B,  i.e.. 


Estimates  of  the  \i  +  ai,  the  least-squares  mecns  for  the  A  classes,  are 
obtained  by  sirrrolv  solving  the  equations  for  |j,  +  ai  after  computing  the  Sj 
constants.    For  exairole. 


=  Z  bj  S(bj) 


=  R(|i,ai,bj)  -  R(n,aj_), 


and 
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The  remaining  inverse  elements  for  the  complete  variance"Covaria?ice 
inverse  may  be  obtained  if  desired  from  a  relatively  simple  but  general  matrix 
multiplication  procedure.    If  C  is  the  inverse  matrix  of  the  set  of  equations 
remaining  after  absorption  and  after  appropriate  restrictions  have  been  tm- 


is  the  inverse  of  the  diagonal  matrix  for  the  classes  i\x  +  a^)  or 


posed^  D 

subclasses  absorbed  and  N^^  is  the  modified  matrix  of  off-diagonal  coefficients 

in  the  original  least-squares  equations  which  associate  the  effects  being  ab» 
sorbed  with  the  remaining  effects,  then  the  inverse  elements  for  the  n. .° 


section,  (G),  are  obtained  from  -D 
out  interaction,  this  is 


ij 


Nj^C.    In  the  two  way  classification  with- 


p. 


%1  %2 


l(q-l) 


2(q-l) 


P(q-l) 


,11 


.21 


,12 


.22 


.Kq-l) 


,2(q-l) 


.(q-l)(q»l) 


where  the  n|^  are  obtained  from  the  subtraction  of  the  last  coliimn  of  the  n^j 
from  each  of  the  other  n^^  colxjmns,  ioeo, 

n^.  -  «ij  -  « 

If  b^  was  set  equal  to  zero  to  obtain  C,  then  the  last  column  of  the  n^^ 

can  be  deleted  rather  than  subtracted  out  as  indicated  abovee    However,  in  this 
case  the  inverse  elements  obtained  do  not  apply  directly  to  the  constants  as 
shown  in  the  model  (3),    Also,  the  constants  which  are  thus  obtained  directly 
are  linear  functions  of  \i,  a^,  and  b^  rather  than  the  constants  themselves. 

The  immerse  elements  for  the  n  +  a.  square  segment  (A)  are  then  obtained 


from 


.-1 


A  -  D  "  (I  -  N^G^) 


where  I  Is  the  Identity  or  unit  matrix  and  G'  is  the  transpose  of  Ge 

Since  D  is  a  diagonal  matrix  in  the  absorption  of  a  set  of  equations,  the 
matrix  multiplication  procedures  required  to  complete  the  inverse  can  be  done 
rapidly  -  -  even  on  a  desk  calculator  in  many  cases.    In  addition,  the  use  of 
this  method  to  obtain  the  complete  varisnce-covariance  inverse  often  results  in 
less  acc\Bnulation  of  rounding  errors.    It  is  also  possible  with  this  pro- 
cedure to  selectively  compute  the  inverse  for  the  ^  +  a^  when  only  certain  of 

these  means  are  to  be  compared.    If  desired,  the  diagonal  elements  for  the 
H  +  a^  can  be  computed  without  computing  the  off-diagonal  delments.    This  can 
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often  be  accomDlished  without  much  effort  even  though  there  are  several 
hundred  |i  +  aj_  being  estimated. 


l\Tien  the  a.±  effects  are  random  it  is  often  d  esirable  to  obtain  the 
maximum  likelihood  estiiiste  for  the  A  classes,    Henderson  (l9l'9)  ^  has  shown 
that  the  maximum  likelihood  estimates  can  be  obtained  bv  regressing  the  least- 
squares  meaVsS,    The  amount  bv  which  the  least-squares  mean  must  be  regressed 
is  a  function  of  the  corresDcnding  diagonal  inverse  element,  i..e., 


(|i  +  ai  -  |1) 


i  il  +  (ai)  , 

vjhere  |1  +  a^  is  the  regressed  least-squares  mean,  i.e.,  the  maximum  likelihood 
estimate,  and  the  Aii  are  the  diagonal  inverse  elements  for  the  (i  +  a^^ 
Henderson  shows  that  the  m,aximum  likelihood  estimate  reduces  to 


l  +  (n-l): 


(ai) 


for  the  one-wav  classification,  where  n  is  the  number  of  observations  in  the 
i'th  A  class  and  r  is  the  simole  intraclass    correlation  or  reoeatability.  The 
reason  why  these  formulas  are  only  apr)roxiiTBtions  of  the  maximum  likelihood 
estimates,  rather  than  the  exact  formulas,  is  because  \i  and  the  a^'s  must  be 
known  without  error  to  obtain  the  rraximum  likelihood  estimate.    Since  this  is 
imDOssible  in  practice,  the  estimates  of  \i  and  the  a^'s  are  used  to  obtain  the 
"best"  unbiased  estimate • 


Numerical  Example 

The  same  data  used  for  the  numerical  examnle  in  the  two  previous  sections 
will  be  used  here  by  rearranging  them  as  follows: 


Henderson,  C,  R,    Estimation  of  general,  specific  and  maternal  combining 
abilities  in  crosses  among  inbred  lines  of  swine.    Unoublished  Ph.D.  thesis, 
Iowa  State  College  Library.  199  P.p. 
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Ration  Pig       ^^ire  No. 

No.  No.    12  3 


1 

1 

5 

2 

3 

2 

6 

3 

3 

^ 

_ 

|] 

- 

6 

- 

5 

- 

7 

Siih-.+n  +  al  c! 

'-J 

■a 
J 

I 

i 

2 

o 

0 

h 

2 

3 

8 

\x 

3 

9 

6 

L 

- 

- 

6 

5 

7 

Sub-totals 

25 

27 

Totals 

l6 

30 

Iteans 

U 

6 

5 

1)  Mathematical  model: 

Tijl^  »  M.  +  Si  +  r j  +  eijk 
i  =  1,  2,  3 

3  =  1>  2 

k  =  1,  2,  nij 

where : 

yijk     "the  gain  of  the  k'^-^  barrow  on  the  j'th  ration  by  the  ith  sire, 
^  =  the  overall  mean  with  equal  subclass  frequencies. 
Si  «  effect  of  the  i"*^^  sire, 
=  effect  of  the  ration, 
®ijk  '  random  errors, 

2)  Least-squares  equations: 


|J- 

SI 

S2 

—7; — 

S3 

n. 

— R  

RHM 

p.: 

18 

U 

8 

6 

8 

10 

9h 

si: 

h 

0 

0 

2 

2 

16 

S2: 

8 

0 

8 

0 

3 

h8 

S3- 

6 

0 

0 

6 

1 

30 

ri: 

8 

2 

5 

1 

8 

0 

37 

r2: 

10 

2 

3 

5 

0 

10 

57 

It  will  be  noted  that  a  number  of  dependencies  exist  in  these  equations. 
Fo-^  example,  in  the  a  equation  the  sum  of  the  coefficients  for  %  equals 
the  sum  of  the  coefficients  for  the  r^,  the  overall  number  of  observations. 
Likewise,  the  sum  of  the  right-hand  mgmbers  for  the  Si  and  the  equa- 
tions each  equal  the  right-hand  m.ember  for  the  [l  equation.    Hence,  the 
determinant  of  the  coefficient  matrix  is  zero  and  no  unique  solution  or 
inversion  is  possible  without  imposing  some  restrictions  on  the  equations. 

Imposing  the  restrictions  that  2  s^^  Z  rj  =  Ot 


ji 

tl 

§2 

■  F.HM 

18 

-2 

2 

»2 

Sl- 

10 

6 

h 

-1)1 

32: 

2 

6 

Ih 

6 

18 

^1* 

-2 

h 

6 

18 

-20 

Inverse  of  the  reduced  least-squares  coefficient  matrix  and  constant 
estimates  t 

The  matrix  inverse  to  the  reduced  coefficient  matrix  is  as  follows: 


1^ 

SI 

S2 

n 

.0611)86 

,021818 

-.022160 

.009363 

.0218I48 

.lhL8l9 

-. 06117 h 

-.009363 

-.022160 

-.06ll7li 

.1120h7 

-.026217 

n.- 

.009363 

-.009363 

-.026217 

.O67I1I6 

The  constant  estimates  obtained  bv  multinlving  the  inverse  and  RHW's  of 
the  reduced  matrix  together  are 

p.  =  Li,  8876  S2  =  1.311j6 

^1  =  -.8876  ri  =  -.8090 

Estimates  of  S3  and  r2  are  then  computed  as  follows: 

^  =  -(-.8876  +  1.31h6) 

=  -.1270 
r2  =  -(-.8090)  =  .8090. 

Sums  of  squares  for  the  analysis  of  variance: 

R(|i,si,rj)  =  (Ii.8e76)(9l.)  +  (-.8876) (-Ih)  +  (1.31b6)(l8)  ^  (-.8090)(-20) 
=  511.7036. 
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The  total  uncorrected  sum  of  squares  for  y^-nr  is  llZyf =  568. 
Error  S.Sqs.  =  568  -  511.7036  =  56.296ij  with  no  interaction. 
S.Sqs.  Sires(S)  =  R(vi,Sj^,rj)  -  2 

=  511.7036  -  [i96.0250  =  15.6786 

S.Sqs.  Rations(R)  »  R!(^t,S4,r^)  -  Z  -iJL 

J       i  "i. 

=  511.7036  -  502.0000  -  9.7036 

S.Sqs.  Interaction  (SR)  =  ZZ         -  R(^i,s^,rJ 

ij  "ij 

=  5Ij1.9333  -  511.7036  =  30.2297 
S.Sqs.  Within  SR  Subclasses  =  568  -  5hl.9333 

=  26.0667 

The  analysis  of  variance  is  as  follows: 


Source  of  Variation 

d.f. 

S.Sqs. 1/ 

M.S. 

F 

Sires(S) 

2 

15.6786 

7.8393 

Rations(R) 

1 

9.7036 

9.7036 

SR 

2 

30.2297 

I5.11b8 

6.96-{Hf 

Within  Subclasses 

12 

26.0667 

2.1722 

1/  The  sums  of  squares  for  sires  and  rations  are 
unadjusted  for  the  interaction. 


Since  the  test  of  significance  for  interaction  is  sifnificant  at  the  .01 
level  of  orobabilitv,  the  least-squares  anal'^'"sis  should  be  comoleted  with 
constants  being  fitted  for  the  interaction  effects  as  well  as  the  main 
effects.    When  all  subclasses  are  filled,  the  aporoririate  least-squares 
analysis  can  best  be  completed  for  the  two-wav  classification  by  the 
method  of  weighted  squares  of  means.    A  discussion  and  the  presentation 
of  methods  of  analvsis  when  the  interaction  must  be  considered  is  de- 
ferred until  the  next  section.    The  analvsis  of  these  data  will  proceed 
as  tiiough  no  interaction  effects  existed. 

With  no  interaction  effects  considered  the  analysis  of  variance  is 
as  follows: 


Source  of  Variation 

d.f. 

S.Sqs. 

M.S. 

F 

Sires  (S) 

2 

15.6786 

7.P393 

1.9^  n.s. 

Rations(R) 

1 

9.7036 

9.7036 

2.hl  n.s. 

Frror 

111 

56.296b 

1.0212 
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Standard  errors  and  individual  comoarisons: 
(a)    Standard  errors  of  least  squares  means. 


sjl  -  V  (.0611.86)  ()i.C212) 
=  .50 


S(i+Sl 

_  .  /  r  OAT  1  PA 

=  V   yOolhoo  + 

=  1.00. 

,l)i[i8l9  + 

1,2  a  .021o[!0  jj  {h,vdld) 

=  \A  f.06l)i86  + 
=  .72, 

.1120Li7  + 

(2)  (-.022160 5j  (!!.0212) 

sil+s3 

=  y  [.06lh86  ^ 
=  .39. 

.13h5l8  + 

(2) (.0003121  (1.0212) 

=  yj  [.061I.!86  + 
=  .77 

.067lil6  + 

(2)(. 009363))  (Ii.0212) 

sj+r2  =  /  [70611.86  +  ,067M'6  +  (2) (-.009363)]  (l!.0212) 


(b)    Individual  comparisons  among  the  Sj^. 

(i)    Kean  separation  with  Duncan's  Ktiltiole  Range  Test  (.05  level). 


Com-oarison 

Vi-y. 

/cii+CJJ-2Cij 

Product 
Differences 

Zp>"2 

S2  vs.  S3 

1.7lil6 

2.396 

lj.17 

6.08 

S2  VS.  s^ 

2.2292 

2.297 

5.12 

6.38 

S3  vs.  s^ 

,'i606 

2.116 

.97 

6.08 

These  procedures  are  carried  out  on  these  data  onlv  for  the  purpose 
of  showing  the  comoutstions  involved.    In  oractice,  if  the  F  test  in 
the  analvsis  of  variance  is  non-significant,  it  is  doubtful  that  one 
would  want  to  use  a  mean  ser^aration  Drocedure, 

(ii)    Orthogonal  comparisons  among  the  Sj_. 

The  oomnoutational  orocedures  involved  in  comouting  the 
least-squares  sums  of  squares  foi-  orthogonal  contrasts  among 
a  set  of  constants  were  illustrated  in  the  two  previous 
sections  with  the  numerical  example.     Since  this  procedure  is 
always  the  same  regardless  of  the  other  constants  being  fitted 
in  the  least-squsres  analvsis,  it  will  not  be  shown  here. 


7)    Estimation  of  variance  comDonents. 


The  number  of  degrees  of  freedom  for  sires  and  rations  are  in- 
adequate, of  course,  to  give  accurate  estimates  of  variance  components 
for  sires  or  for  rations.    However,  in  order  to  illustrate  the  computa- 
tional procedures,  which  are  directly  apDlicable  to  analvses  where  the 
number  of  degrees  of  freedom  are  adequate,  the  effects  of  sires  and 
rations  will  be  regarded  as  random  and  estimates  of  a|    and  or  will  be 
obtained. 

The  expectation  of  the  mean  square  for  rations  is  a|  +       a^,  and  the 
exT)ectation  of  the  mean  square  for  sires  is  a|  +  k2a|  when  the  inter- 
action does  not  exist.    Since  the  mean  square  for  error  contains  only  ct| 
the  problem  is  to  compute       and  k2.    As  pointed  out  above,  these  k  values 
mav  be  comuuted  b^r  either  the  direct  method  or  the  indirect  method#  Al- 
though the  indirect  method  is  the  easier  to  use  in  the  two-way  classi- 
fication both  methods  will  be  presented  here.    Extension  of  the  direct 
method  to  more  comolex  analvses  where  it  is  more  useful  than  the  indirect 
method  is  straight-f onward. 

(a)    Direct  method  of  com.t)uting  the  k's. 

The  inverse  matrices  to  the  Z  segments  of  the  complete  matrix 
inverse  (L)  are  as  follows: 

"B.  975050  L.9CC08li 

I.9OOO8L  11.600112 


The  coefficients,  k]_  and  k2,  are  now  comr>uted  as  follows: 
k-L  =  I  (1I,.03327U)  =  7.1il7 

k2  =  ^[8.875050  +  11.600112  -  I  (h.90008L  +  1j.90008L)] 

=  i  (15.675078)  =  5.225 
3 

The  sums  of  squares  for  sires  and  rations  mav  now  be  verified  from 
B'Z~-^B, 

S.Sqs.  Sires  =  (-.8876  1.3lL6)Zs^B 
=  (-1.52L6    10.9002)  B 
=  15.683 
S.Sqs.  Rations  =  (-.8090)Zg^  B 
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r.  Dill  81 9 

L061171 


I1819  -.06ll7h 


061171  .II20L7 


ZS^  =  r.067)il6]'"^    =  riL.83327)j" 


»  (-12.0001)  B 
•=  9.708 

These  check,  within  rounding  errors,  with  the  sums  of  squares  ob 
tained  with  the  indirect  procedure » 

(b)    Indirect  method  of  cormDuting  the  k's. 

Step  1»    Computation  of  the  matrix  inverses  to  the  reduced 
matrices. 

The  inverse  of  the  coefficient  matrix  for  p,  +  s^  is 
jjL+S]^  \i-*'S2  (l+s^ 


y+r2: 


\i+S2'' 


1 

0 

0 

h 

0 

1 

0 

8 

0 

0 

1 

T 

The  inverse  of  the  coefficient  matrix  for  is 


IS 


160 


Step  2.    Computation  of  the  "associated  sums"  matrices,  NN' 
Rations  associated  with  sires: 


*  8  16  12" 
IF   3|i  20 

12    20  26 


Sires  associated  v;ith  rations: 


N  .  .  =  N  N  ' 


2  5  1 
2    3  5 


30^  2A 
24  38 


2  2 
5  3 
1  5 
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4 

8 

6 

4 

o' 

0 

2 

-4 

8 

2 

6 

-4 

lln 


step  3*    CoiTOuting  the  sum  of  products  of  R^J  and  N^^. 
Sires: 

22  =  10.583. 

Rations : 

22  Rij  N.  .  «  7.550. 

.Step  U«    Coitrouting  the  k  values. 

18-10.583      7  ,,7 
^1  '   ~J         ~  7  •''17. 

>  18=L^  =  5.225 

These  k  values  check  exactlv  with  those  computed  by  the 
direct  method.  With  the  two-way  classification  this  in- 
direct method  reduces  to  the  following; 

=  18  -  (^)'-^(^>'  -  (^)'-^(3)^  -  (1)^-^(5)^ 
^  h  8  6 


=  7.1)17, 


k2 


■  2         "  '  8  ■       10  ~J 


=  5.225 

Computing  the  variance  comnonent  estimates. 

The  analysis  of  variance  with  the  expectations  of  the  mean 
squares  is  given  below: 


Source  of  Variation 

d.f.  S.Sqs. 

K.S. 

E(MS) 

Sires(S) 

2  15.6786 

7.8393 

a|+5.225a| 

Rations (R) 

1  9.7036 

9.7036 

a|+7.)il7aj 

Error 

IL.  56.296lj 

L.0212 

9.7036  -  I1.0212 
7.M7 

«  .766 

7.8393  -  I1.0212 
5.225 

=  .731 
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8)    Absorption  of  the  \i  equations. 

(a)    Calculation  of  the  new  or  adjusted  coefficients  for  the  r-j. 

The  calculations  involved  in  the  absorption  of  the  jjl  + 
equations  may  be  obtained  from  the  following  table: 


^-1. 

^1 

^2 

RHM 

+  s^: 

Ii 

2 (.50000) 

2 (.50000) 

16(1-.  00000) 

^1 

-  S2: 

8 

5(. 62500) 

3(. 37500) 

Ii8(6. 00000) 

*  S3: 

6 

1(. 16667) 

5(. 83333) 

30(5.00000) 

The  values  in  parentheses  above  are  the  numbers  just  outside  the 
parentheses  divided  by  ni,,  the  diagonal  coefficient  for  the  |i  +  s^ 

equation.    The  calculation  of  adjustments  which  must  be  irede  in  the 
coefficients  for  the  rj  and  in  the  RHM's  for  the  rj  equations  are  now 
completed  as  follows : 

C(riri)  =  8  -(2) (.50000)  -(5) (.62500)  -(l)(. 16667)  =  3.70833. 

C(r]^r2)  C(r2r3^) 

-  -(2)(, 50000)  -(5)(. 37500)  -(1) (.83333) 

=  -(.50C00)(2)  «(3)(. 62500)  -(5)(. 16667)  «  -3.70833. 

C(r2r2)  =  10  -(2)(.5000C)  -(3)(. 37500)  -(5)(. 83333)  -  3.70835. 

S(ri)  =  37  -(2) (Ii. 00000)  -(5)(6.00000)  -(l)(5.00000) 

»  37  -(.50000)(16)  -(.62500)(h8)  -( .16667)(30)  «  -6.00000. 

S(r2)  -  57  -(2) (h. 00000)  -(3)(6.00000)  -( 5) (5.00000)  =  6.00000. 

The  reduced  equations  after  absorption  of  the  fi  +       equations  are 
as  follows: 


^1 

RHM 

^1* 

3.70833 

-3.70833 

-6.00000 

^2' 

-3.70533 

3.70835 

6.00000 

Except  for  rounding  errors,  the  coefficients  for  r^  sum  to  zero 
by  rows  and  columns  and  the  RHM's  sum  to  zero  by  column, 

(b)    Imposing  the  restriction  that  Z  r-j  =  0,  coirouting  the  r^  and  the 
inverse  of  the  reduced  iratrix.j 


.li9- 


is 


and 


The  remaining  equation  after  the  subtraction  by  rows  and  columns 

lli.8333li  ri  -  -12.00000 
T  -12 .00000 
^  lh.8333U 

«  -.8090 
T2  »  -r^^  »  08090. 


These  are  the  same  values  obtained  for  the  r^  when  the  solution 
to  all  equations  was  obtained  directly. 

The  inverse  of  the  reduced  matrix,  of  only  order  one  in  this 
case,  is 

I0535C  - 

It  will  be  noted  that  this  is  the  same  inverse  element  obtained  for 
this  position  of  the  matrix  inverse  when  the  inversion  of  the  ji,  9^ , 
S2  and       reduced  matrix  was  completed  simultaneously, 

(c)    Back  solution  for  the  {i  +  Sj^  constants. 


0 

0  0 


0  0 
1 


'16  \ 

18 

^30; 

0  0 

1 


0   i  0 
8 

0    0  i 

^ij.ooooN 

6.2022 

L.L607/ 


16.0000 

h9.6l80 
26.76I4O 


(d)    Computation  of  the  variance-covariance  inverse  matrix  and 
maximum  likelihood  estimates  for  the  sire  means. 
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^.067L16"] 
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0  -1 
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h 

c 

0 

fl    0  0^ 

s 

0 

1 

8 

0 

j  0    1  0 

■) 

0 

0 

1 

6. 

V '  ^) 

^1 

h 

0 

0 

i.oobooo 

.000000 

0 

1 

8 

0 

,000000 

1.C33708 

0 

0 

1 

6._ 

.000000 

-.0671)16 

'  .250000 

.000000 

.oooooo' 

.000000 

.12921h  - 

.011236 

^  .nooooo  . 

-.011236 

.196629 

(0 -♦016854  .0ljh9Ui) 


,000000 


The  A  matrix  inverse  aorlies  directlv  to  the  \x  +  s±»    This  metrix 
may  be  transformed,  as  shown  below,  to  give  the  inverse  elements 
which  apply  separately  to  \i  and  the  s-^. 


KAK 


111 


.250000      .000000  .000000 
,000000      .129211  -.011236 
-1    2  -Ij     [.000000    -.011236  476629 


2  -1  -1 
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.250000 

.117978 

.185393 

1 

2 

-1 

,500000 

-.117978 

".165393 

1 

1 

-1 

2 

3 

26q66)i 

-.219101 

3 

\ 

-1 

-1 

'  .061186 

.02l8h8 

-.022160^ 

.021818 

.III1819 

-.061171 

-.022160 

-.C6117U 

.II20L8 

The  transformed  inverse  elements  for  the  r^  column  and  row  are  obtained 
from 


KG 


111 
2  -1  -1 
-1    2  -1 


0 

-.01685L 


■.009363 
-.009363 
-.026217 

The  variance-covariance  inverse  obtained  by  this  indirect  manner  is  given 
in  the  following  table; 


S2 

^1 

.061L86 

.021818 

-.022160 

.009363 

^1- 

.0218I18 

.lLih8l9 

-.06ll7h 

-.009363 

-.022160 

-.061171 

.1120).!  8 

-.026217 

ri: 

.009363 

-.009363 

-.026217 

.O67LI6 

It  will  noted  that  these  inverse  elements  check  within  rounding  error  of 
those  obtained  directlv.    Of  course,  with  only  four  equations  this  in- 
direct method  is  inefficient.    However,  when  there  are  a  large  number  of 
constants  to  be  fitted  in  one  of  the  two  sets,  the  absorption  procedure 
is  extremelv  useful.    In  many  oroblems,  the  inverse  for  the  section  of 
the  inverse  matrix  involving  the  constants  absorbed  is  not  required.  How- 
ever, the  method  of  obtaining  these  inverse  elements  after  absorption  is 
useful  in  some  problems,  particularilv  where  a  large  number  of  constants 
have  been  absorbed  and  tests  of  significance  are  desired  among  only  a 
portion  of  those  depending  on  the  values  obtained  for  p,  +  Sj_,    Also,  only 
the  diagonal  inverse  elements  for  the  (1  +  Si  can  be  computed  by  modif^dng 
this  procedure  slightly.    These  diagonal  inverse  elements  are  required  to 
compute  the  apriroximate  maximum  likelihood  estimates  of  the  effects. 
The  ji  +  Sj_  for  the  numerical  examnle  are 


a  +  1  =  Li.8876  +  — 'P'^  — — ^  (-.8876) 

^       1  ,731  +(o250000)(i-.0212)    ^  •  ' 

=  I1.8876  +  (.L210)(-.8876) 
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(i  +  S2  4 

i  5.65 

A>  73"! 

=  h.8876  -i-  (ei.8G2)(^.l'270) 

=  ]u68 

The  '*best"  estimate  of  theexDected  Derformance  of  additional  rr.ndom  pro- 
geny bv  each  of  these  three  pires,  when  an  equal  number  are  raised  on 
each  of  these  two  rations,,  is  given  bv  the  aoDroxims te  iDa>:imujn  likelihood 
estimates  above.     The  ranking  of  the  (1  +       is  very  likely  to  be  different 
than  that  of  the  (x  +  Sj^  when  more  numbers  are  available  and  when  more 
variability  in  frequencv  occurs  from  subclass-to~subclass.    Hence,  the 
{x  +       are  far  more  useful,  than  are  the  least-squares  means,  the  fl  +  Sj_ 
in  determining  which  anim.als  are  to  be  retained  and  which  are  to  be 
culled  in  a  selection  nrogram* 


h.51 

»731 

u,8876  +  — 7r7ii9mTrinc^^ 

L..8876     (,58a5) (1.311:6) 


aASSIFI CATION  WITH  INTERACTION 

If  data  are  classified  in  onl-v  two  ways  and  it  is  necessary  to  consider 
the  interaction  effects,  sim.plifi€d  methods  are  available  for  comDleting  the 
least-squares  analysis  unless  one  or  more  of  the  subclasses  are  not  filled. 
For  the  special  case  'of  only  two  classes  of  A  or  two  classes  of  B  a  weighting 
method,  described  in  several  books  on  statistical  methods,  provides  a  short- 
cut procedure  for  com.puting  all  suras  of  squares  in  the  analyses  of  variance. 
VJhen  there  are  more  than  two  classes  for  both  A  and  B  and  all  subclasses  are 
filled,  the  weighted  squares  of  means  method  of  analysis  provides  a  short-cut 
procedure  for  com.puting  the  least-squares  sums  of  squares  for  the  main  effects. 
In  this  case,  a  short-cut  procedure  is  also  available  for  computing  the 
variance-covariance  inverse.    This  procedure  will  be  described  and  illus- 
trated with  a  numerical  exanmle  in  thi;^  section. 

When  all  AB  subclasses  are  not  filled  and  the  interaction  must  be  con- 
sidered it  is  necessary  to  use  least-squares  procedures  which  involve  matrix 
arithmetic  in  order  to  obtain  unbiased  estimates  of  the  constants  and  the 
sum^  of  squares  for  tests  of  significance.    This  direct  procedure  also  is 
necessary  if  partial  regression  constants  are  being  fitted  along  with  those 
for  \iy  aj_,  bj,  and  the  interaction  constants,  (ab)j_j,  even  though  all  sub- 
classes are  filled.    In  this  latter  case  however,  the  alternative  procedure 
for  computing  the  inverse  of  the  variance-covariance  matrrx  that  was  pre- 
sented in  the  section  on  the  one-wav  classification  vrith  covariance  may  be 
useful. 
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The  purpose  of  this  section  is  to  present  the  general  least-squares- 
analysis  for  data  classified  in  two  ways  when  constants  for  the  interaction 
effects  must  also  be  fitted.    Problems  arising  in  the  least-squares  procedures 
when  interaction  constants  are  being  fitted  are  presented  in  considerable 
detail* 


Model 

Ine  general  linear  mathematical  model  for  the  two-way  classification  with 
interaction  is 

yijk  ="     +       +  b^  +(ab)ij  +  e±^]^ 
i  =  1,  2,  — ,  p 
3  =  1,  2,  — ,  q 
k  =  1,  2,   ,  n^j 

where : 

^ijk  "  observation  in  the  3"^^  B  class  and  i'th  A  class, 

\i  =  the  overall  mean  with  equal  subclass  numbers, 

aj_  =  effect  of  the  i^^  a  class, 

bj  =  effect  of  the  J'th  B  class, 

(ab)j_j  =  effect  of  the  ij^^  AB  subclass  after  the  average  effects  of  A 
and  B  have  been  removed.    These  are  the  individual  interaction 
effects  expressed  as  a  deviation  from  the  mean  \i, 

®ijk  '  random  errors.    Assumed  to  be  MD(C,a|), 

The  mathematical  model  is  the  same  regardless  of  whether  the  ai  and/or 
the  bj  are  fixed  or  random  effects.    If  both  are  fixed,  the  interaction 
effects  are  also  fixed.    When  all  effects  except  the  errors  are  fixed  the 
model  is  referred  to  «s  Model  I  of  Eisenhart  or  the  fixed  model  and  if  all 
effects  except  [i  are  random  it  is  referred  to  as  Model  II  of  Eisenhart  or  the 
random  model.    The  model  is  regarded  as  mixed  when  one  set  of  effects  (either 
the       or  bj)  is  fixed  and  the  other  is  random.    In  the  mixed  model  the  inter- 
action effects  are  random*    The  least-squares  procedures  required  to  compute 
the  suiTi  of  squares  for  the  analvsis  of  variance  are  the  saire  regardless  of 
whether  a  set  of  effects  is  random  or  fixed.    With  fixed  effects,  the  least- 
squares  means  with  pairwise  or  orthogonal  comparisons  are  often  desired.  On 
the  other  hand,  with  random  effects,  the  investigator  is  interested  in  ob- 
taining unbiased  estimates  of  variance  or  covariance  components.    At  least  a 
portion  of  the  matrix  inverse  to  a  variance-covariance  matrix  is  required  in 
order  to  test  the  significance  of  pairwise  or  orthogonal  comparisons  or  to 
obtain  estimates  of  variance  cr  covariance  components.    Where  feasible,  i.e.. 


if  the  mimber  of  equation?  is  :,-jot  too  large,  the  inverse  of  the  variance- 
covariance  inatrix  for  the  complete  rnon'el  should  be  comDuted* 


Least-Sgnares  Equations 

The  least-squares  equations  for  the  two-way  classification  with  inter- 
action are  given  in  tabular  forir:  below; 


^i 

6. 

J 

(ab)ij 

RHM 

{J,  J      n  „ . 

nij 

Y.. 

a-;  '  nj_^ 

"ij 

Yi. 

"ij 

(ab)-  J  ;  n^j 

0  "13  ' 

This  complete  set  of  equations  contains,  (i)  one  equation  for  p.,  (ii)  one 
equation  for  each  of  the  p  classes  of  A,  (iii)  one  equation  for  each  of  the  q 
classes  of  P,  and  (iv)  one  equation  for  each  of  the  subclasses  of  A  and  B 
which  has  one  or  more  observations.    If  desirable,  however,  constants  for  only 
a  selected  group  of  the  interaction  effects  need  be  fitted.    In  this  case, 
equations  for  onlv  the  selected  groisn  of  subclasses  would  be  included  for  the 
interaction  effects. 


Imposing  Restrictions  on  the  Constants 

A  unique  solution  to  the  least-squares  equations  can  not  be  obtained 
until  they  are  reduced  in  number  to  the  number  of  degrees  of  freedom.  Al- 
though numerous  restrictions  mav  be  imposed  in  order  to  accomplish  this,  the 
restriction  that  the  constants  for  the  main  effects  sum  to  zero  within  a  set 
and  that  the  constants  for  the  (ab);^^  sum  to  zero  over  each  row  and  over  each 
column  is  orobablv  the  most  satisfactory.    The  linear  mathematical  model  it- 
self suggests  this  set  of  restrictions  since  the  effects  of  aj_,  bj,  (ab)j^j, 
and  the  e±^  are  expressed  as  deviations  from  the  mean  u,^ 

Using  the  restriction  which  requires  the  setting  of  one  of  the  constants 
in  each  set  of  main  effects  equal  to  zero  (say,  a^  and  Sq)  suggests  that  the 
(at))4q,  the  (alD)pj  and  the  (a'b)r)q  interaction  constants  could  also  be  set 
equal  to  zero,  thereby,  allowi.ng  the  deletion  of  all  these  equations.  Al- 
though a  unique  solution  txD  the  equations  can  be  obtained  when  this  is  done, 
the  estimates  for  the  constants  are  entirelv  unsatisfactory,  since  they  a  re 

bq  +  aV)„j  -  ahpq 


-  a 

=.  Si 

K 

■"'n  addition  to  the  confounding  of  the  constant  estimates  when  using  these 
latter  restrictions  the  total  reduction  in  sum  of  squares  obtained  from 

p-1  q-1  p-1  q-1 

.\  K  ^.  *  ,\  H       '  ,\  .\  (^)ia  Yij 

1=1  j=l    ^  1=1  3=1 

is  incorrect,  being  biased  upwards.    Also,  the  sums  of  squares  for  the  various 
effects  obtained  from  the  sub-inverse  iretrices  and  the  prime  constant  es- 
timates are  incorrect.    Hence,  the  simple  deletion  of  eouations  to  remove  de- 
nendencies  among  the  least-squares  equations  when  interaction  constants  are 
being  fitted  mav  lead  to  serious  errors. 

By  imposing  the  restrictions  that  Z  a-;  =  Z  B-;  =>  Z(at))-^  ^  »  Z(ab)^  -?  =  0 

on  the  prime  estimates  of  the  constants  -.t  is  possible  to  compute  the  a-j^, 

and  (ab)-j_j.    In  addition,  the  inverse  matrix  can  be  transformed  to  give  the 

inverse  that  would  have  been  obtained  had  the  restrictions  that  Z  a^-  =  Z  £4  = 

i  ^ 

Z  (ab)j^j  =  Z(ab)j_j  =  0  been  imposed  on  the  original  least-squares  equations. 

All  these  manipulations,  however,  require  far  more  work  than  required  in  the 
subtractions  and^additions  within  the  original  equations  to  impose  the  re- 
strictions that  ai  and  bj  sura  to  zero  and  the  (a'b)ij  sum  to  zero  by  rows  and 
columns. 

When  the  restrictions  that  Z  a^  =  Z  £^  =  Z(ab)4  ^  =  Z(a1))i^  =  0    are  im~ 

i  j    ^      i         ^  j 

posed  on  the  least-squares  eouaticns  it  is  necessary  to  carry  out  a  number  of 

subtractions  and  additions  within  the  coefficient  matrix  and  the  right  hand 
members.    The  subtractione  required  within  the  a^  and  b^  equations  are  the 
same  bv  column  and  by  row  as  explained  lureviously.    Within  the  set  of  coeffi- 
cients for  the  (al))i-j  the  subtractions  and  additions  which  may  be  con- 
veniently   chosen  for  each  row  are  as  "^ollows: 


I 


After  these  changes  have  been  made  bv  column  the  same  procedure  is  followed  by 
row  with  the  modified  coefficients  for  the  (ab)ij  equations  and  for  the  RHM's, 
Hence,  the  reduced  RHM's  for  the  reinaining  (ab)i-j  are 

Considerable  manipulation  of  the  coefficients  and  Rffll's  of  the  least- 
squares  equations  are  required  to  iiroose  these  latter  restrictions.  However, 
if  desirable,  the  subtractions  and  additions  required  are  easily  programmed  on 
high  speed  computers  and  when  the  number  of  equations  are  small  they  can  be 
completed  quicklv  on  a  desk  calculator  or  adding  machine.    By  computing  the 
subtractions  and  additions  for  all  elements  that  are  to  remain  in  the  reduced 
matrix  a  check  on  the  computations  is  provided  for  the  off -diagonal  elements 


of  the  coefficient  rr^atrix.  since  the  redrwed  coefficient  mf^trix  rtnist  b 
ST,TTmietricaj.  about  the  main  diagonal*, 


Completing  the  Least- Squares  Analysis 

Any  standaro'  procedure  my  be  used  to  coTnmte  the  matrix  inverse  to  the 
Tariance-covariance  R-atrix  and/or  the  solution  to  the  reduced  set  of  equa- 
tions«    It  is  often  desirable  to  cornpute  the  solution  of  the  equations 
directly  for  at  least  one  RKI^j  and  at  the  same  time  as  the  inverse  matrix  is 
being  computedj  in  order  to  check  on  the  accumulation  of  rounding  errors*  The 
difference  between  the  constant  estimates  obtained  from  direct  solution  of  the 
equations  and, those  obtained  by  raultiol^rlng  the  inverse  matrix  by  the  reduced 
RHM^s  provides  a  measure  of  the  accuimilation  of  rounding  errors.    It  is  de- 
sirable to  have  the  tvio  estimates  of  the  same  constants  agree  to  at  least  four 
significant  digits »    Checking  of  the  off -diagonal  elements  of  the  inverse, 
when  a  method  of  inversion  is  used  which  comDutes  all  elements  of  the  inverse^ 
is  an  undesirable  check  on  the  accumulation  of  rounding  errors  since  the 
rounding  errors  effect  both  off -diagonal  elements  in  much  the  same  manner* 
When  no  complete  confounding  of  effects  remains  in  the  reduced  least-squares 
matrix  and  no  constants  are  being  fitted  for  partial  regressions^  the 
accumulation  of  rounding  errors  is  seldom  a  problem,  when  standard  procedures 
using  seven  or  eight  significant  digits  in  all  calculations  are  used  to  im^ert 
the  matrix^ 

The  inverse  elements  for  the  column  and  row  for  ap  (or  bq)  may  be  ob- 
tained by  adding  the  inverse  elements  for  the       columns  (or  rows)  and  re-» 
versing  the  sign  of  the  siim  as  explained  in  the  first  secticn.    The  same  type 
of  procedure  is  used  to  obtain  the  inverse  elements  for  the  interaction 
columns  and  rov/s  which  were  eliminated^  e.g., 

0-1 

3=1 

q~i 

C^jabiq  ^  (.abiqbj  ^  "^f  c^jsbij 

^     ,  D-1 

i=l 

i=l  i=l 

etc. 

The  sum  of  squares  for  error  in  the  analysis  of  variance  is  computed  from 


The  to-*-,al  reduction  in  sum  of  squares,  Rr(j,,ajL,b^,  (aj)^^! ,  mav  also  be  coin« 

puted  from  ZZ         in  this  case^  since  all  of  the  variability  among  the  "3  sub- 
ij.^ij 

classes  is  accounted  for  by  tue  constants  being  fitted.    Hence,  by  computing 
this  total  reduction  in  both  ways  another  check  is  provided  on  the  accumul- 
ation of  TOunding  errors  during  the  solution  of  the  equations. 

The  su^s  of  squares  f'^r  A,  B,  and  AB  may  be  computed  by  ^he  direct  pro- 
cedure, B'Z-1b,  which  involves  the  segments  of  the  ^natrix  inverse  to  the  var- 
iance-»covariar.:-e  matrix  and  the  constant  eotimates  <  r  from  the  indirect  pre  - 
cedure  involving  differences  in  various  reduct-'ons  in  sums  of  squares.    If  the 
indirect  procedure  is  used  the  solution  to  other  sets  of  equations  is  required 
For  example,  the  sum  of  squares  for  A  (usinj  the  indirect  method)  is 
Rjjijaijbj,  (ab)i.j1|  -  R[|j,,b4,  (ab) .    The  R [|i,b (ab) is  uotained  bv  dejeting 
all  equations  for  the       (by  column  and  row)  from  the  reduced  set  of  least- 
squares  ei'  lations,  solvi.ng  the  r  3maining  equations  and  computing  the  total  re- 
duction in  the  usual  manner. 

W^en  the  number  of  degraes  of  freedom  for  the  interaction  effects  is 
larg3  the  sum  of  rquares  for  interaction  can  usually  be  obtained  more  easily 
from  R|ji,ai»b.j,  (ab)3_-jj  -  R(|i,aj^,b  j)  than  from  B'Z-^B,  even  though  the  com- 
plete inverse  of  the  variance-covariance  matrix  is  availa^-le.    When  the  inter- 
action constants  must  be  litted  along  with  the  main  effects,  the  sums  of 
squares  for  the  main  effects  can  usuallv  be  computed  directly  from  B'Z°'1b  more* 
easily  than  from  differences  in  reductions,    however,  if  all  subclasses  are 
filled  the  sums  of  scuares  for  the  main  effects  can  be  obtained  with  the 
weighted  squares  of  means  procedure  more  easilv  than  with  either  of  the 
methods  considered  above. 

Tl-e  least-square s  means  for  the  classes  of  A  and  the  classes  of  B  are 
+       and  p,  +  oj,  respectively.    The  standard  errors  foi^  ji,  p,  +  Z.^,  ^.nd 

ji  +       are  computed  in  the  same  manner  as  described  in  the  previous  section  ,« 
fr.'^m  the  appropriate  inverse  elem.ents  and  the  estimate  of       from  the  error  ; 
line  cf  the  analysis  of  variance.    However,  when  the  interaction  effects  are 
significant,  the  investigator  is  more  xnteresteo  in  the  AB  subclass  means 
rather  than  the  class  means.    S'^nce  a  least-squares  subclass  mean  is 

s^j  =  Jx  +  a^  >  bj  +  (ab)ij, 
the  s'-andard  error  ra^v  be  computed  from  the  ir^-erse  mairix  and  a|  as  follows-  j 


In  th©  fcwo-way  classificatioa  -when  gill  .w.bcla@gea  are  filled  and  the  inte-r^ 
action  constants  are  fitted,- 'the  standard  error  of  tha  subclass  mean,  s^j, 

reduces  to  ^/  "  1         «    The  l-jug  formula  for  the  standard  error  of  the  two<«way 

subclass  mean  given  above  is  useful  when  all  celli  are  liot  filled  or  when  all 
interaction  constants  are  not  fitted  as  well  as  when  other  constants,  ?ach  as 
pa.rtial  regressions  or  other  main  effects,  must  also  he  fitted  at  the  mme 
time. 

Mean  separation  procedures  can  be  completed  for  the       and  the  in 

the  same  manner  as  described  in  previous  sections.    Procedures  for  obtaining 
sums  of  squares  for  individual  degree-of -f reedom-orthogonal  comparisons  among 
the  a^  or  among  the       are  also  explained  in  previous  sections.    The  same  pro- 
cedures may  be  used  to  partition  all  of  the  variation  among  the  a^,  the 
and  that  due  to  the  ab^^  into  ©ingle  degree«of <-£ reedom«orthogonal  contrasts. 
In  this  case,  the  diagonal  inverse  matrix  involving  the  s^^  would  be  trans- 
formed with  the  appropriate  transformation  matrix.    For  example,  suppose  there 
are  two  classes  of  A  and  three  classes  of  B  and  the  following  single  degree 
of  freedom  orthogonal  contrasts  are  desired  among  the  subclass  means: 


^2  ^3 

«21 

«22 

^23 

¥■ 

1 

1  1 

1 

1 

1 

A 

1 

1  1 

-1 

B 

2 

■°1  -1 

2 

-1 

-I 

0 

1  -1 

C 

1 

-1 

AB 

2 

-1  -1 

-2 

1 

1 

0 

I  -1 

0 

1 

Using  the  rules  discussed  in  a  previous  section,  the  transformation  matrix, 

K,  is 


"4 

4 

4 

4 

4 

4° 

4 

4 

»4 

«4 

»4 

L. 

24 

6 

-3 

-3 

6 

»3 

»3 

0 

6 

-6 

0 

6 

-6 

6 

-3 

•»3 

„6 

3 

3 

0 

6 

-6 

0 

»6 

6 

The  transformed  matrix,  T,  is  then  computed  froni 

&nd  the  constants  for  the  orthogonal  contrasts,  frosi  KB  «  c^,  where  B  is 

a  column  vector  of  the  t . . . 


The  "vd-thin"  subclass  mean  square  is  the  auDrcpriate  error  term  for 
testing  the  significance  of  the  interaction  regardless  cf  whether  the  model  is ' 
fixed,  mixed  or  ran'^om.    Likewise,  this  error  mean  square  is  the  appropriate 
error  term  for  testing  the  significance  of  both  main  ef -^ects  in  the  fixed 
model  and  for  the  random  main  effects  in  the  mixed  model.  There  is  no  exact  test 
of  significance  for  the  fixed  train  effects  in  the  mixed  model  or  for  both  main 
effects  in  the  random  model  when  both  unequal  subclass  frequencies  and  the 
interaccion  exist. 

The  expectation  of  the  miean  squares  for  the  random  model  is  as  follows: 

4  ^  ^l4b 


A 

B 
AB 
Error 


Since  k]_,  k2  and  k^    are  all  •'different  values  when  unequal  subclass  numbers 
exist  (except  when  D=q=2)  even  though  all  subclasses  are  filled,  no  exact  test 
of  significance  existsfor  either  the  a^  or  bj  effects  in  the  random  model. 
However,  this  is  not  a  serious  problem,  since  the  investigator  is  primarily 
interested  in  obtaining  the  best  unbiased  estimates  of  the  variance  com- 
ponents when  all  effects,  except  p,,  are  random,  rather  than  in  tests  of  sig- 
nificance.   He  vri.ll  often  have  a  Driori  evidence  that  the  a;^^  and  bj  effects 
exist. 

The  coefficients  for  cfb  in  the  mean  square  for  AB,  k]_,  and  the  co- 
efficients for  q§  and  af,  k-^  and  k^,  ma.j  be  computed  by  either  the  direct  or 
the  indirect  procedure  described  in  the  previous  section.    The  other  co- 
efficients, k2  and  kj^,  are  computed  by  the  indirect  procedure.    If  every  AB 
subclass  is  filled  all  of  these  coefficients,  except  ki,  can  be  computed  from 
the  weights  used  in  the  weighted  squares  of  means  procedure  as  described  in 
the  next  sub-section. 

The  expectation  of  the  m.ean  s  quares  for  the  mixed  model  with  the  aj_ 
effects  random  and  the  b-^  effects  fixed  is  as  follows: 

E(>S) 

4  ^5^ 

4  ^  ^24h  ^ 
4  "  H4h 

Error  a| 

The  variance  due  to  the  a^,  a|j  is  now  estimated  over  a  fixed  set  of  B  effects 
and  therefore  the  interaction  variance  which  f  ets  into  the  sum  of  squares  for 
A  can  not  be  separatee  from  the  main  effects       A.    Hence,  the  denominator 


A 

B 
AB 
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mean  square  for  testing  the  significance  of  the  A  effects  (over  this  ;et,of 
fixed  B  classes)  is  the  error  mean  square.  Also,  the  estimate  of  is  ob«. 
tained  by  computing  oalv  kc;  in  the  mixed  model. 

Since  k-j_  and  k2  will  differ  when  unequal  subclass  frequencies  exist  no 
exact  test  of  significance  exist  for  the  b^  effects  in  either  the  randora  or 
mixed  model.    The  approximate  test,  in  which  the  mean  square  for  B  is  divided 
by  the  mean  square  for  AB,  decreases  in  desirability  as  the  am.ount  of  unequal 
frequencies  increase  and  as  the  size  of  a|b  increases. 

The  expectation  of  the  mean  squares  for  the  fixed  model  is  identical  with 
that  given  above  for  the  mixed  model  except  that  k2<7ab  disappears  from  the 
E(MS)  for  B,  'Hence,  all  effects  are  tested  for  significance  by  using  the 
error  mean  square  as  the  denominator  for  F« 

With  the  fixed  model  and  missing  subclasses j  it  must  be  realized  that  all 
estimates  of  constants  and  all  tests  of  significance  resulting  from  the  least- 
squares  analysis  are  relative  to  the  population  of  subclasses  actually  in- 
cluded in  the  data.    Generalization  of  the  result?  to  combinations  of  A  and  B 
which  do  not  occur  in  the  data,  of  course,  can  not  be  made*    This  would  be 
true  even  though  the  interaction  effects  for  the  subclasses  available  did  not 
exist  since  no  estimate  of  the  interaction  effects  outsidt  of  the  data  can  be 
obtained. 

VJhen  data  are  analyzed  under  the  mixed  model,  the  absence  of  AB  sub- 
classes from  the  data  creates  a  ve:r^  undesirable     situation  as  far  as  es- 
timation of  variance  components  is  concerned.    In  this  case,  the  variance  due 
to  the  random,  effects,  say  a|,  is  estiireted  over  a  fixed  set  of  effects  and 
the  absence  of  AB  subclasses  means  that  some  of  the  a^  effects  would  not  be 
measured  over  all  of  the  B  fixed  classes.    Hence,  it  is  usually  desirable  to 
eliminate  the  A  classes  which  are  not  observed  with  all  B  classes  (when  the 
are  random  and  the  bj  are  fixed)  or  eliminate  the  B  classes  which  are  not  ob- 
served in  combination  with  all  A  classes  if  the  estimation  of  a|  is  of  primary 
concern.    If  the  nrim:arv  interest  in  the  analvsis  is  to  obtain  unbiased  es- 
timates of      +  bj  with  minimum  error  when  missing  subclasses  exist,  there  is 
no  need  to  eliminate  any  of  the  filled  subclasses  available  since  the  effects 
of  B  are  measured  over  a  random  sample  of  s.±  effects. 


Short-cut  Procedures 

When  data  are  classified  in  only  two  wavs  with  all  subclasses  filled  and 
the  interaction  must  be  considered,  the  weighted  squares  of  means  procedure 

'   can  be  used  to  compute  the  sums  of  squares  for  the  main  effects  which  are  ad- 
justed for  the  interaction  effects,    A  short-cut  procedure  can  also  be  used 
to  compute  the  coefficients  of  variance  components  in  the  expectation  of  the 

I   mean  squares  for  the  main  effects.    Hovjever,  the  weighted  squares  of  m.eans 
procedure  does  not  provide  for  the  computation  of  the  smn  of  squares  for 

j  interaction  (AB)  or  a  short-cut  procedure  for  computing  the  coefficient  for 

'  '-^abj  ^Ij        the  mean  sqiiare  of  AB,    The  interaction  sum  of  squares  is  usually 
computed  by  the  general  indirect  procedure  for  computing  a 
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least-squares  sum  of  squares  in  the  two-way  classificatioii,  i.e., 
S.Sqs — AB  =  Rlii,&±,h^,{ah)i^  -  R(jji,aj_,b 


=  Z2 


■J 


p-l  0-1 

S  ai(Yi.-Y  )  »  Z 
i-1  j»l 


where  \i,  the       and  the  6j  are  the  constants  which  oolve  the  reduced  least" 
squares  equations  when  the  interac'icn  effects  are  om.itted  from  the  model* 
If  k-^  is  required  it  can  usually  best  be  computed  by  the  general  indirect  pr^^ 
cedure  for  comouting  coefficients  of  variance  components,  i.e 


r-p-4+1 


(n.. 


where  r  is  the  number  of  AB  subclasses;  the  R^J  are  the  inverse  elem.ents  of 
the  matrix  inverse  to  the  reduced-least  .squares-coefficient  matrix  when  the 
interaction  constants  are  omatted  from  the  model,  the  N^^  are  elements  of  the 
"associated"  sums  matrix  comnnted  from  M'  as  explained  in  the  previous 
section. 


The  weighted  squares  of  m,eans  procedure  for  comouting  estimates  of  the 
ji  +  a^  and  u  +  bj  the  least-squares  sums  of  squares  for  A  and  B  and  co« 
efficients  of  variance  comnonents  in  the  E(MS)  is  given  below: 


Z  Sj_^ 
3  ^ 


q 


2  s^. 


qs. 
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S.Sos.— B  =  Z  v.b 


V. 


2  v1 


pTq^ 


(v.  - 


v. 


2  v' 
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(v.  - 
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Z  wl 
1  1 


2 


When  all  subclasses  are  filled standard  errors  of  the  and  £j  may  be  coni" 
puted  directly  from  the  weighting  factors  and  the  estiirate  of  the  error  var- 
iance as  follows: 


1/ni^    ®  Wi 


The  standard  error  of  a  subclass  mean,  Sj_jj  is 

and  the  standard  error  of  the  difference  between  any  two  subclass  means^  say, 


4 


Hence,  mean  seoaration  procedures  can  be  applied  easily  to  the  subclass  means, 

If  mean  separation  procedures  are  to  be  applied  to  the  p,  +  aj_j  or  the 
jl  +  Sj,  or  if  single  degree~of -freedom-orthogonal  comparisons  are  desired, 
inverse  elements  from  the  matrix  inverse  to  the  variance-covariance  matrix 
are  required.    When  all  cells  are  filled  the  complete  or  partial  matrix  in- 
verse can  be  obtained  bv  a  short-cut  procedure.    In  effect,  the  diagonal  in- 
verse matrix  for  the  subclasses,  is  transformed  with  the  use  of  the 
sppropriate  transformation  matrix,  K,  to  obtain  the  matrix  inverse  to 
the  comDlete  variance-covariance  matrix,  i.e.,  C-1  =  KD-^K",    The  trans- 
formation matrix,        is  the  functional  relationship  matrix  which  gives  the 
association  of  the  sij,  the  subclass  means,  and  {1,  the  aj_,  the  Sj,  and 
(alD)j[-j.    For  exaroole,  if  p      2  and  q  =  3  the  transformation  matrix  is 
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1  1 
1  -1 

-1  2 
»1  -1 

-1  ~2 
-1  1 


1  1 
-1  -1 
-1  -1 

2  -1 

1  1 

»2  1 


a^,  b4,  and  (ab)i4  constants  in 


where  the  row  coefficients  refsr  tc  the  yi, 
order  and  the  column  coefficients  refer  to  the  subclass  means  arranged  in  the. 
usual  order?  S]_]_,  si2,  ^21f  ^22>  ^^"^  ^23*    '^^^  forirailas  which  give  the 

■     ■      ■  Z(ab)ij. 


relLationship  of 
Z(ab)4  4  =  0  are  as  follows: 
3  ^ 


the  subclass  means  and  the  constants  when  ZS± 
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The  coefficients  for  the  s±^  in  these  fori-nulas  are  the  elements  of  the  trans- 
formation matrix  (K)e    Coefficients  for  the  transformation  matrix  are  easily 
written  and  could  orobably  be  generated  by  a  high  speed  computer  for  almost 
av.y  combination  of  p  and  q.    The  numerator  cc^-f ficients  for  any  (ab)..     row  can 

quickly  be  obtained  from  the  products  of  the  numerator  coefficients  in  ih?.  aj_ 
and  bj  rows* 

Since  D"-^  is  a  diagonal  matrix  the  comnutations  can  be  arranged  in  a 
s-i/stematic  manner,  as  illustrated  in  the  section  on  the  one-way  classification 
with  covariance,  and  onlv  those  inverse  ele.nents  required  need  be  computed. 
After  com-nuting  the  inverse  elements,  mean  seoaration  procedures  and  com- 
putations  required  for  testing  the  significance  of  orthogonal  contrasts  are 
carried  out  as  previous] r  described. 


I 


Absorption  of  the 


a^  Equations 


VJhen.  subclasses  are  missing  in  the  two  wav  classification  with  inter- 
action or  if  the  analysis  includes  the  fitting  of  all  constants  for  the  two  way 
classification  with  interaction  plus  constarts  for  other  effects,  such  as  ma.in 
effects  or  partial  regressions,  absorption  of  the  p  +  ai  equations  into  the 
equations  for  the  remaining  constants  mr-y  be  desirable.    However,  since  con- 
stants are  being  fitted  here  for  the  (ab)^^  effects  the  proportionate  reduction j 
in  size  of  the  matrix  is  not  as  grsat  as  is  likely  to  be  the  case  vjhen  inter» 
action  constants  are  not  being  fitted.    If  the  absorption  process  is  to  be 
utilized  the  m.odel  should  usually  be  arran<7''d  so  that  p>q. 


i 


The  new  coefficients  in  the  eqiations  for  the  bj  and  the  (ab)^^  after 
absorption  of  the  \.i  +  a^  equations  are  coirouted  as  follows: 


C(bib-s)      n..  S 
C(b,b.)-^.^ 


C[bj(ab)i^]  =  C[(ab),^(ab)i;j  «         -  ^ 
C[b.(ab)ij,]  =  c[(ab)i^(ab)ij.]= 
cj(ab)ij(ab)i,j]  =  c[{ab)ij(ab)i.3]  -  0 


The  new  right  hand  members  for  the  bj  and  (ab)j_j  equations  are 


ni. 

After  absorption  of  tim  p,  +       equations y  restrictions  may  be  imposed  that 
Z  Sj  =  Z(ab)j^j  »  Z(ab)ij  »  0  and  the  necessarv  subtractions  completed  to  re- 
move the  dependencies  prior  to  solution  and/or  inversion.    With  estimates  for 
the       and  the  (alD)ij  the  original  equations  for  p,  +       can  be  solved  to  ob- 
tain the  jl  +  aj^.    With  the  segment  of  the  matrix  inverse  for  the  bj  and 
(ab)j^j,  the  inverse  elements  for  the  remaining  segments  of  the  complete  matrix 
inverse  may  be  computed,  if  desired,  with  matrix  multiplication  procedures. 
The  basic  procedure  for  completing  these  calculations  was  presented  in  the 
previous  section  on  the  two-way  classification  without  interaction  and  will  be 
given  in  general  matrix  notation  in  the  next  section. 


Estimation  of  Variance  Components 

When  all  effects  in  the  model  are  random,  except  ^.t,  two  useful  short-cut 
methods  which  provide  unbiased  estimates  of  the  variance  components  will  be 
considered.    Both  of  these  methods  are  due  to  Henderson  (1953)§/  and  are  re- 
ferred to  by  him  as  Method  1  and  Method  3.    Both  methods  are  based  on  least- 
squares  princir)les.    However,  the  estimates  obtained  for  the  variance 


6/  Henderson,       R.  Ibid. 


comDon'^nts  by  the  two  rrethods  will  differ  frv  any  given  set  of  data  and  both 
£  its  of  estimates  will  c'iffer  f rorr  those  obtained  from  the  coimlete  least- 
squares  ariali^sis  as  .'escribed  above.    The  variances  of  estimates  of  variance 
components  from  data  with  -unequal  subclass  frequencies  are  unknuwn. 

However,  intuitivelv  it  seems  that  Method  1  of  Henderson  provides  the 
least  efficient  estimates  and  the  coinDlete  least-squares  analysis  provo^des  the 
most  efficient  estimates.     Hence,  where 'feasible,  the  variance  component  es- 
timates should  be  obtained  froin  the  complete  least-squares  analvsis.    In  many 
instanc-^s,  however,  the  number  of  equations  is    „oo  large  to  attempt    the  com- 
olete  least-squares  anal'^'-sis  when  the  interactior.  inust  be  conside"^ed  and  all 
cells  we  not  filled. 

Method  1  of  Henderson  requires  the  least  labor  of  all  three  methods. 
Sums  of  squares  for  the  two  main  effects,  the  interaction  and  the  within  sub- 
class are  computed  in  the  usual  manner  for  balanced  designs  except  that  ap- 
propriate divisors  are  used  for  the  unequal  numbers.    Coefficients  for  the 
variance  comoonents  in  the  mean  squares  (or  sums  of  squares)  are  then  com- 
Duted  and  the  computed  mean  squares  are  set  equal  to  tlieir  expecta-^ion.  The 
resulting  set  of  simultaneous  equations    is  solved  to  obtain  estimates  o-f  the 
variance  comoonents.    In  the  twc -wav  classification  with  interaction  the  an- 
alysis is  as  follows: 


Source  of 


Vari' tion 

d.f. 

S.Sqs 

iL 
n, , 

A 

p-1 

2  - 

^l'-^7^lb-'^8'^)-''^9^i 

B 

q-1 

3  "-j 

n. , 

^l^^[^ab-''-5^b^^6^J 

AB 

r-.p-q+l 

ij  n.ij 

Z 

"  i  ^  " 

z  -lA  +  ** 

j  n.j  n.. 

^l*^l'^lb^^'^2^-*-k3^l 

i.,.-r    ZZZ  y?.,.  -  Z2  -ii 
ijk  ij  n^j. 


Error 

The  nine  k  values  in  E(I-!S)  are  comouted  as  follows; 


n  -  r-p-q+l 


n. ,  - 

Hi 

1  -^^  •  -  T 

1  "i.  ' 

Znf_.  Zy^ 


7     p-1        ni,       n..  ' 

7     p-1  n,. 

Many  terms  are  common  among  these  formulas  for  the  k's.    Actually,  there  are 
only  five  terms  involving  the  nij,  n^^,  n.j,  and  n..  that  must  be  computed 
from  the  data. 

Method  3  of  Henderson  is  a  least-squares  analysis  in  which  the  sums  of 
squares  and  expectations  of  the  error  and  AB  interaction  mean  squares  are 
computed  in  the  usual  manner.    The  sums  of  squares  for  the  main  effects,  A 
and  B,  are  computed  from 

S.Sqs.— A  »  R(^i,ai,bj)  -  E(^i,bj) 

-  R(ix,ai,b,)  -  Z  _li 
and                               S.Sqs.— B  =  R(iifii,h^)  -  (|i,ai) 

-  R(n,ai,bj)  -  2  — 

rather  than 

S.Sqs.  A  «  R[pi,aj^,bj,(ab)i^  -  R[^t,bJ, (ab)^ J 


-67- 


S.Sqs — B  =  R^,ai,t)j,(ab)ijj  -  R j[i,ai, (ab)ij'j 

as  is  done  when  the  comolete  least-squares  analysis  (or  weighted  squares  of  M 

means  anal^^is,  when  a  ;plicable)  is  rrHde.    The  E(KS)  in  the  analysis  of  var-  fl 

iance  can  then  be  set  up  as  follows  for  th^  random  model:  ■ 


E(>S) 

A         of  +  k[,a|b  +  k^a| 


AB  a|  +  k-La|b 
Error  a? 


The  coefficient  for  in  E(MS)  for  AB  is  obtained  in  the  usual  manner 
for  a  least-squares  analvsis,  as  nreviouslv  descri>-ed«    The  coefficients  for 

and  a|,  ^  and  k^,  mav  be  commted  directly  from  the  inverse  of  the 
square  symmetrical  segments  of  the  matrix  inverse  to  thfe  variance-co- 
variance  matrix  when  the  interaction  effects  have  been  deleted  from  the  model. 
However,  k^  and  k^  can  irore  easily  be  computed  by  the  indirect  method  since 
the  last  reduction  in  sum  of  squares  used  in  comouting  both  A  and  B  reduces  to 
a  "between"  sum  of  squares.    The  ^orirralas  for  computing  kj  and  therefore 
become  ...m 


q-1  i 

The  coefficients  for  CT|b  in  E(MS)  for  A  and  B,  k^  and  kj,  are  computed  by 


thfe  indirect  method  as  follows: 


where  the  R-^^  are  the  inverse  elements  fror  the  matrix  inverse  bo  the  var- 
iance-covariance  nr  trix  for  the  model,  yj^jk  =  ^     at     bj  +  ej  jk,  and  %j 
are  the  elements  of  the  "associated  sums"  iratrix  computed  from  NN'.  | 


Nximeri cal  Examnle 


The  h:/T>othetical  set  of  data  analyzed  in  the  three  previous  sections  to 
illustrate  comriut^tional  procedures  vili  now  be  analyzed  under  the  two-way 
classification  model  with  interaction.    The  general  least-squares  procedures 
that  would  be  required  with  missing  subclasses  will  first  be  given  for  these 
data  followed  by  the  weighted  squares  of  means  procedures  that  are  applicable 
when  all  cells  are  filled.    The  mathematical  model  for  this  anaxvsis  is 

yijk  -  |i     Si  +  r j  +  (sr)ij  +  Bijk 

.    .  i  »  1,  2,  3 

3  -  1,  2 

k  -  1,  2,  n^j 

where 

y^ilk  '         Saii^  o^"  the  kth  barrow  on  the  j'^^^  ration 
by  the  i^^  sire, 
\i  »  the  overall  population  mean  with  equal  subclass 
frequencies, 
si  "  effect  of  the  i'^h  sire, 
»  effect  of  the  jth  ration, 
(sr)j^j  ■  interaction  effects, 
®ijk    *  random  errors. 

General  Least-Squares  Analysis 
1)    Least-squares  equations 


The  least-squares  equations  for  these  data  are  represented  in  tab- 
ular form  below: 


(1  si  S2  S3       r2  ( 

S2')ll 

(3r)i2  ^ 

sr)2i 

(sr)22 

(sr)31 

(sr)32 

RHM 

18 

h 

8 

6 

8  10 

2 

2 

$ 

3 

1 

$ 

9h 

Si- 

h 

h 

0 

0 

2  2 

2 

2 

0 

0 

0 

0 

16 

8 

0 

8 

0 

5  3 

0 

0 

3 

0 

0 

m 

S3: 

6 

0 

0 

6 

1  5 

0 

0 

0 

0 

1 

5 

30 

n' 

8 

2 

T 

8  0 

2 

0 

5 

0 

1 

0 

37 

1*2  • 

10 

2 

3 

5 

0  10 

0 

2 

0 

3 

0 

5 

57 

(sr)ii: 

2 

2 

0 

0 

2  0 

2 

0 

0 

0 

0 

0 

11 

(sr)x2' 

2 

2 

0 

0 

0  2 

0 

2 

0 

0 

0 

0 

(sr)2is 

5 

0 

5 

0 

$  0 

0 

0 

i 

0 

0 

0 

23 

(sr)22S 

3 

0 

3 

0 

0  3 

0 

0 

0 

3 

0 

0 

25 

(sr)3i: 

1 

0 

0 

1 

1  0 

0 

0 

0 

0 

1 

0 

3 

(sr)32: 

5 

0 

0 

0  5 

0 

0 

0 

0 

0 

i 

27 
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2)    Imposing  the  restrictions  that  Xs^  =  Zr-  =  Z(sr).  .,  =  Z(sr)^  ,  =  0 

i         j         i  j 

When  these  restrictions  or  constraints  are  irrroos^d  on  the  constant 
estimates  and  the  appropriate  subtractions  and  additions  are  nade  the 
reduced  least-squares  .equations  aupear  as  follows  when  placed  in  tabular 
form: 


il  S]_ 

S2  Ti  {sr] 

11^^'^>21 

'A 

18  -2 

2-2  h 

6 

SI 

-2  10 

6  h 

-1. 

ah 

S2 

2  6 

111    6  -L 

-2 

18 

^1 

-2  h 

6  18  -2 

2 

-20 

(sr)ii 

h  -h 

-L  -2  10 

6 

30 

(sr)2i 

6  -L 

-2    2  6 

22 

3)    Matrix  inverse  to  the  variance-co\ariance  matrix  and  constant  estimates. 


The  inverse  of  the  6x6  svmmetrical  coefficient  (or  variance-covar- 
iance)  matrix  is  given  in  the  table  below? 


^1 

S2 

^1 

(sr)ii 

(sr)2i 

,075926 

.007L07 

^.03l!'8l 

,018519 

-.018519 

-,029630 

Sit 

.159259 

-.05185'^ 

-.018519 

.018519 

.029630 

.120370 

-.029630 

.029630 

.OO7L07 

ri: 

.075926 

.0071' 07 

031^81 

(sr)ii: 

.159259 

-.051852 

(sr)2i: 

.120370 

The  inverse  elements  to  the  left  of  the  main  diagonal  are  omitted  i 
from  the  table  since  they  would  be  the  same  as  those  to  the  right  of  the 
main  diagonal.    Several  of  the  off-diagonrl  values  in  the  matrix  inverse 
are  of  the  same  magnitude  although  in  some  cases  the 7  differ  in  sign. 
Also,  the  sub-matrix  of  inverse  elements  for  the  xnteraction  constants  is 
the  same  ap  the  sub-matrix  for  the  sire  constants.    These  similarities 
occurred  primarily  bocai'se  of  oni'^'-  two  classes  for  rations  in  thir.  ::et  of 
data  ai:!d  will  not  occur  when  p  and  q  are  each  larger  than  three. 

The  constant  estim.ates  obtained  by  muitipl:\i.ng  the  inverse  matrix  and 
the  RHM's  in  the  reduced  matrix  together  are  as  follows s 

(1  =  h.8889  ?!  =  -.5222 

»  -.8889  (sr)xi  -  2.0222 

S2  =  1.5778  (s''r)2i  =  -1.3liiili 

Estimates  of  the  remaining  constants  are  confuted  as  follows: 
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S3  =  ^(-.8889  +  1.5778)  -  -.6889 

r2  -  »(-.5222)  =  .5222 

(s"r)i2  =  -(2.0222)  =  ™2.0222 

(s?)22  =  -^1.3lihL)  =  1.3l.!i)! 

(sr)^^  «  -(2.0222  -  1.3M'M')  =  -.6778 

(s"r)32  =  -^2.0222  +  1.3M'l)  =  .6778 

Four  decimal  digits  are  carried  ir  the  constant  estiirates  to  avoid  ser- 
ious rounding  errors  in  the  commutation  of*  suites  of  squares.  Obviouslyj 
these  estimates  are  not  accurate  to  four  decimals. 

Sums  of  squares  for  the  analysis  of  variance. 

S.Sqs,— Error  =  |^|^ijk  -  R      ^i^rj,  (sr).^ 

=  568  -  (L.8B89)(9iO  -(-.88^9) (-Ih) 

-(1.5778)(18)  -(-.5222)(-20)  -(2.0222) (30) 
-(-1.3bMi)(22) 

=  26.0652 

When  constants  are  fitted  for  all  degrees  of  freedom  among  tiie  subclasses 
the  error  sum.  of  squares  mav  also  be  computed  from 


With  this  example, 

S.Sqs, 


ZZZy: 


-Error  =  568  -  5M.9333  =  26.0667 


which  checks  (T(«.thin  rounding  error)  with  the  sum  of  squares  obtained  for 
error  using  the  general  computing  method  above.    It  will  be  noted  that 
this  is  the  same  sum  of  squares  that  was  used  for  error  in  the  two-way 
classification  without  interaction  for  testing  the  significance  of  the 
interaction  effects. 

The  sums  of  squares  for  sires  (S),  rations  (R)  and  the  interaction 
(SR)  are  computed  by  the  general  direct  procedure  (B'Z~^B)  as  follows? 

-1 

.159259  -.051852 
..051852  .120370 


S.Sqs.. 


»  [-.8889  1.5778| 
=  [-.8889    1.5778]  ^ 
=  [-1.5281    12.M97]  [^--f 


303395 
11j6096 

5889 
5778J 


3.1^6096 
9.662967 


[-.8889] 
|l.5778j 

t.8889l 
t..5?75j 
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=  21.C01^ 

r  ,         r. 159259  -.051852" 


[10.5393  -6.6289] 


"2,0222] 


=  30.22a5 

ExcPD-  fot>  rounding  errors,  it  will  be  noted  that  the  sum  of  squares  ob- 
tained for  intera-ticn  in  this  manner  is  the  saine  as  that  computed  m  the 
last  section  from 

Y?  . 

ZZ.  -ii  -  R('4*Si,r^)  . 

However,  it  should  also  be  noted  that  the  sums  of  squares  for  S  and  R 
above  are  corsiderablr  different  from  those  obtained  for  these  sources  of 
variation  when  the  :.ntersct.ion  effects  were  disregarded.    Also,  of  coursej 
the  constant  estimates  for  the  Si  and  rj  are  different  in  the  two  an- 
alyses.   Still  another  point  worth  noting  in  the  two  analyses  is  the 
difference  in  the  diagonal  inverse  elements  for  [i^  s-^,  so,  and  ri*  >^hen 
ad:justment  must  be  made  for  the  interaction  effects  these  inverse  el  ements 
are  alwavs  larger  unless  ecual  subclass  frequencies  exist.    The  only 
factor  causing  these  inverse  elements  to  be  larger  and  the  constant  es- 
timates and  the  sums  cf  squares  for  3  and  R  to  differ  is  the  unequal  sub- 
class frequencies.    The  distribution,  and  not  the  magnitude  of  the  inter- 
action effects,  is  the  onlv  factor  involved  in  miaking  these  adjustments. 

When  both  the  Si  and  the       are  fixed  the  anal --sis  cf  variance  may 
be  completed  as  follows s 


Source  of  \rariation 

d.f. 

S.Sqs. 

M.S. 

F 

S 

2 

21.0015 

10.5008 

b.83-"- 

R 

1 

3.5916 

3. 5916 

1.65  n.s. 

SR 

2 

30.22li5 

15.1122 

6.56-"- 

Error 

12 

26.C652 

2.1721 

-^Indicates  significance  at  the  o05  level  of  probability. 

5)    Least-squares  means,  stardax-d  errors  and  individual  comparisons. 

The  least-squares  means,  which  mav  be  of  interest  in  an  analysis 
such  as  this,  are 

j}_  =  [,,9  „  »      The  overall  mean  expected  with  equal  numbers. 

+  s^  =  h.C  Sire  No.  1  mean. 

il  -H  S2  =  6.5  ■ —     Sire  No.  2  mean. 


li.2 

Sire  Mo 

3  mean. 

jj.  +  T"! 

h.U 

Ration 

No,  1  mean 

Mt  +  ^2 

a 

5.1' 

Ration 

No»  2  mean 

+ 

h. 

+ 

5.5 

>  - 

Sire  1 

-  Ration  1 

subclass 

mean. 

ft 

+ 

h 

+ 

+ 

31 

2.5 

_  . 

Sire  1 

-  Ration  2 

subclass 

mean. 

+ 

h 

+ 

h 

+ 

(sr)2i 

m 

h.6 

Sire  2 

-  Ration  1 

subclass 

mean. 

+ 

+ 

(sr)22 

8.3 

Sire  2 

-  Ration  2 

subclass 

meai.. 

(a 

+ 

h 

+ 

(sr')3i 

3 

3.0 

Sire  3 

-  Ration  1 

subclass 

mean. 

h 

+ 

+ 

(sr)32 

3 

5.b 

Sire  3 

-  Ration  2 

subclass 

mean* 

The  standard  error  for  each  of  these  nsans  may  be  computed  from  the 
elements  of  the  inverse  matrix  and  the  estimate  of  a|,  2,1721,  from  the 
analysis  of  variance  in  the  usual  manner  which  has  previously  been 
illustrated.    Pairvd.se  tests  of  significance  among  the  sire  means  or  am.ong 
the  subclass  means  mav  be  comiDleted  with  the  "t"  test  or  Duncan's  Multiple 
Range  Test  with  the  information  now  available  as  have  also  previously  been 
illustrated.    The  method  of  computing  sums  of  squares  for  single  degree- 
of -freedom-orthogonal  contrasts  was  also  described  and  illustrated  in  the 
earlier  sections. 


6)    EstiTTiation  of  variance  components. 


Although  the  number  of  degrees  of  freedom  for  sires  and  rations  are 
wholly  inadequate  to  obtain  accurate  estimates  of  variance  components  for 
these  two  main  effects  or  for  the  interaction,  the  computational  procedures 
required  to  obtain  these  estinetes  will  be  illustrated  with  thJ.s  small  set 
of  data. 

If  the  effects  of  sires  and  rations  are  both  regarded  as  random^ 
the  expectations  of  the  mean  squares  in  the  analysis  of  variance  are  as 
follows : 


d.f. 

s 

P-1 

R 

q-1 

SR 

r-p-q+l 

Error 

n.  .-r 

The  coefficient  for  a|j.  in  the  SR  mean  square,  k^,  and  the  co.=ri_ 
efficients  for  a^,  and  a|,       and  k^,  may  be  computed  by  the  direct  method 


2/  The  effects  of  rations  would  orobably  never  be  regarded  as  random  in  prac- 
tice. This  is  done  here  merely  to  illustrate  variance  component  estimation 
with  the  random  model. 
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from  the  Z"^  matrices  that  x^ere  used  to  compate  the  sums  of  squares, 
These  computations  are  shown  below: 

_  7,303395  +  9.662967  -(|) (3.1h6C96) 


13.820266 
6 


=  2.303 


^3 
^5 


1/.075926     13.17072  , 

"2 —  — r~  =  ^--"^ 

^.303395     9.662967  ~  (|) (3.1h6096) 

12^220266  ^  ^^607 
3 


The  coefficients  for  agj,  in  The  E(KS)  for  R  and  S  are  computed  by  the 


indirect  method  as  follows: 


■1  = 


k2  =  T      -  m^'^Nij) 


18 

-2 

2 

6 

-2 

10 

6 

-h 

-h 

2 

6 

11 

-2 

ii 

-li 

-b 

10 

6 

6 

^-2 

6 

IL 

.071L09    .01192).  -.C2)'25L! 

.1517L2  -.059079 
,108807 


..020326  -.021952 
.020326  .021952 
.032521  -.OOL1878 


.156536 


,OLi878l 
.107317 


v^hen  the  off-diagonals  on  the  left  are  omitted. 


NM' 


0  0 


-7h^ 


\i  S2 


0 
3h 


(sr)TT 

( ST) 

\  '22 

\Srj 

26 

h 

h 

25 

o 

1 

25 

0 

h 

I 

0 

0 

0 

0 

0 

0 

0 

25 

9 

0 

0 

26 

0 

0 

0 

0 

1 

25 

I 

0 

0 

0 

0 

0 

h 

0 

0 

0 

0 

25 

0 

0 

0 

9 

0 

0 

1 

0 

25 

32 


when  the  off -diagonals  on  the  left  are  omitted. 

The  reduced  K^^^  matrix,  after  subtracting  by  rows  and  colurrans,  is 


68    -18     8     2h  ho 
3h    26    -2ii  ~2h 
60    -21  -3 
3k  26 
60 


This  matrix  may  be  obtained 
more  easily  from  the  N' 
matrix  where  0,  1  and  -1 
codes  are  used. 


when  the  left  half  is  again  omitted. 
Hence, 

k2  -  [l8  -  (68)(,071L09)  -  (-18) (.011921)  -  .  .  (60)(. 107317^ 

=  18  -  I5.80li5hli  =  2.195 

The  calculations  involved  in  computing  kj,  by  the  indirect  method 
as  follows:  " 

ki  »  I  (18  -  ZZRiJ%j) 


R  - 


.067116 


18.  -2  h  6 

-2  18  -2  2 

ii  -2  10  6 

6  2  6  111 


.009363    -.009363  -.026217 
.061U86      .021818  -.022160 
.111819  -.061171 


,1120li7 
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*2 

2 

1 

5 

2 

0 

0 

1 

0 

0 

2 

0 

3 

0 

5 

2 

0 

0 

0 

0 

0 

0 

2 

0 

0 

0 

0 

0 

0 

> 

0 

0 

0 

0 

0 

0 

3 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

5 

^2 

(sr)i2 

(sr)2i  ( 

sr)22 

(sr)3-^  (sr)- 

68 

30 

38 

h 

h 

25 

9 

1  25 

30 

0 

u 

0 

25 

0 

1  0 

38 

0 

I 

0 

9 

0  25 

i. 

0 

0 

0 

0  0 

h 

0 

0 

0  0 

25 

0 

0  0 

9 

0  0 

1  0 

25 

Hence, 


68    -8     2!i  ho 

68    -IB  8 
%  26 
60 


=  I  (18  -  13.39327h)  =  I  (L, 606726)  =  2.303 


The  reduced  N^^  matrix^  after  subtracting  by  rows  and  columns,  is 


Use  of  the  indirect  method  to  comnute  k2  and  kj_,  involves  a  great  deal  of 
arithmetic  even  when  the  number  of  classes  for  both  A  and  B  is  small*  A 
shorter  and  more  direct  method  should  be  available  to  com^oute  these  co- 
efficients, Perhaos  one  will  be  discovered  in  the  near  future.  Until 
then,  the  use  of  the  complete  least-squares  analysis  for  estimating  var- 
iance comt)onents  when  interactions  must  be  considered  and  subclasses  are 
missing  will,  no  doubt,  be  limited. 


Estimates  of  the  variance  components  ma-' 
complete  least-squares  analvsis  as  follows? 


Source  of  Variation 
S 
R 
SR 
Error 


now  be  computed  from  the 
E(KS) 


+  2,303  a|r  +  b.607  cy| 


2.303  air 


a|  =  2.172 
=  5.618 


=  -1.657 

r-r|    =  -1.001 


The  variance  of  these  variance  corrir)onent  estimates,  of  course,  is  ex- 
tremely large.    Kanv  iiiore  degrees  of  freedoin  are  needed  for  both  S  and  R 
in  order  to  obtain  reliable  estiniates  of  the  variance  components.  The 
variance  coiriponent  analA'sis  for  these  data  is  presented  merely  to  illus- 
trate the  commtational  -orocedures* 


V'.'eighted  Squares  of  Means  Analysis 

When  all  degrees  of  freedom  among  a  set  of  subclasses  are  partitioned 
into  a  set  of  orthogonal  comparisons,  as  is  done  in  factorial  analyses, 
the  estimates  of  all  constants  are  determined  from  the  subclass  means.  In 
this  case J  weights  are  easily  determined  that  are  useful  for  commuting 
sums  of  squareSj  variances  of  constant  estimates  and  coefficients  of  var- 
iance components.    Steps  required  in  the  application  of  the  weighted 
squares  of  means  procedures  to  the  analvsis  of  the  data  in  the  numerical 
example  will  now  be  given, 

(2)    Comr-Jutation  of  constant  estinates. 


fi.      si  =  -S^i™^  =  h.OOOO 

3.0+5.1 

H  +  S3  =  ^    2       =  ij.2000 


u.  +  r^ 


p 


d  .      =  ii|020t8J333^5. 1.000  ,  5.1,111 


3 

2 

h 

=  luOOOO 

-  ['.8389  =  -.8889 

h 

=  6.1667 

-  [1.8F89  =  1.5778 

h 

=  li.2000 

-  I1.8889  =  -.6889 
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=  1.3667  -  i.,8889  -  -.5222 
=  -  I... 8889  -  .5222 


Coirrputation  of  weights, 

«2 


2  l/nij  p7^ 


w-i^  =        »  4,0000  v.  =  -JL.  =  5.2911 


W2  =  ~L  =  7.5000  V2  '  YTT  '  ^''^^^'^ 

5  3  2  3  I 


=  3.3333 


Cbmputation  of  sums  of  squares. 

S.Sqs.  Error  »  Witliin  Subclass  S.Sqs.  =  26.0067 

S.Sqs.  SR  »  30.22h5 

The  sum  of  squares  for  interaction  must  be  computed  from  the  least- 
squares  analysis,  i.e.,  B'Z"^B  or  R[ii.,ai,b j,  (ab)i^ -  R(|jj,ai,b j) . 

S.Sqs. — s  «  (h.0000)(-.8889)^  -s-  (7.5000)  (x.5778)2  +(3.3333X-.6889)^ 

-  C<^-0000)(°»8^89)  +(7.500Q)(1.5778)  -^(3.3333)(-»6869)]  ^ 
h.OOOO  +  7.5000  +  3.3333 

=  21.0013 

S.Sqs.— R  »  (5.29lil)(-. 5222)  ^  +(8.7097) ( .5222f  i 

[(5.29lil)(-.52?2)  ■»-(8.7097)(.5222^  ^  i 
5.29hl  +  8.7097 

=  3.5916 

It  will  be  noted  that  these  sums  of  squares  for  R  and  S  agree  within 
rounding  errors  with  those  obtained  by  --Me  general  least-squares 
procedure,  B'Z-^B. 

Computation  of  k's. 

As  pointed  out  previously,  ki  rmist  be  computed  by  the  general 
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1/ 


least-squares  procedure  even  though  all  sv.bclasses  are  fillet^  All 
other  k's  n>ay  be  computed  from  the  weights  as  ■fo  lows: 


k2  = 


TITT 


frCoQM+S  7nQ"^  (5.29Ul)^^(e. 7097)^1 
ry  [(^-^911^8.7090  -  -—^^^^-^^—]^ 


1  (1L.0038  -  ig^) 


(li>0000)^  ^-(7.5000)^  -^(3.3333)^ 
1)1.3333 


3 

=  i  (6.58510  =  2.195 
3 

'k3  =  i  (6.585L)  =  6.585 

»  i  (9.2135)  =  2.303 

=  I  (9.2135)  =  Ii.607 

It  will  be  noted  that  these  k  values  (k2,k^,k|^,,  and  k^)computed  in  this 
manner  agree  with  those  obtained  bv  the  general  least-squares  o-^o- 
cedure.    VJhen  all  subclasses  are  filled 

k2  =  -  ko 

and  k^  =  i  k^  . 

Short-cut  Procedure  of  Corrrouting  the  l^atrix  Inverse 

^^en  all  degrees  of  freedom  among  a  set  of  subclasses  is  par- 
titioned into  a  set  of  orthogonal  co^rparisons,  the  complete  matrix 
inverse  to  the  variance-covariance  matrix  can  be  com.r)uted  from 

C-1  =  KD-^K' 

where  K  is  the  transformation  matrix  and  D-^  is  the  diagonal  inverse 
matrix  for  the  set  of  subclasses.  In  the  oresent  problem  these  com- 
putations are  simolified  with  the  aid  of  the  following  table: 
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"Constant 
Estirnates 


sii 

"l 

2 

-1 

1 

2 

»1 

5.5000 

.500000 

h.S889 

S12 

1 

2 

-1 

-1 

-2 

1 

2 e 5000 

. 500000 

-.8889 

(1) 
S22  D 

—J. 

c 

T 

X 

-1 

LI  •  \J\,Kj\j 

,200000 

1.5778 

1 

-1 

2 

-1 

1 

-2 

8.3333 

.333333 

-.5222 

S31 

1 

-1 

.  T 

1 

-1 

-1 

3.0000 

1.000000 

2.0222 

S32 

1 

-1 

-1 

-1 

1 

1 

5.booo 

.2000CO 

The  transr>ose  of  the  trans^'ormation  matrix  (K')  appears  on  the  l^-ft 
in  the  table.    The  valves  in  the  D""-  column  are  the  reciorocals  of 
the  s'lbciass  numbers.    The  constant  estimates  are  comnuted  from 

il  =  I  (5.5000  +  2.5000    +  i-  +  5.I1OOO)  =  )!.8e89 
SI  =1  [(2)(5.5000)+  (2)(2.5000)-  (l)(h.6000  ™  (l)(5.bOOO)] 


S89 


etc. 


The  inverse  elements  of  the  matrix  invei se  to  the  variance-covariance 
matrix  may  be  co  ^uted  one  at  a  time  ar  follows; 


1 

35 


^(1)^^500000)  +(1)  ^(.500000)  +  +  +  (l)^(.20uOOO)J 


=  .075926 


C^^S1  =2-  [(1)  (2)  (.500000)  +(1)  (2)  (.500000)  +  +  +  (1)(-1)(. 200000)) 


3^ 
.0071107 


-^M-S2  = 


[(1) (-!)(. 500000)  +(!)(-!) (.500000)  +  +  +  (!)(-!)(. 200000)] 
-.031b8l 


(.sr2isr2l  jj-D^f  .500OOO)  +(1)  ^(  .500000)  +  +  +  ri)  2(  .200OOO)] 


,120370 


Application  of  Method  1  of  Henderson 

When  this  m.ethod  is  used  to  obtain  estimates  of  the  variance  com^oonents, 
the  sum.s  of  squares  for  the  analysis  of  variance  aie  computed  as  follows: 
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s  -^Q-.    s  -  iMl!  +  MH  ^  LI2)1  ml 

"  h  8  6  18 

-  502  »  h 90. 8889  =  11.1111 

^  8  10  18 

=  196.0250  -  1j90.8889  =  5.1361 

(11)^      (5)^  (27)^ 
S.Sqs.  SR    =  +  +  4  +  „  502  -  496.0250  +  490.8889 

=  5M.9333  -  507.1361  =  31.7972 
S.Sqs.— Error    -  568  ~  511.9333  =  26.0667 


analvsis  of  variance 
Source  of  Variation 

is  as 
d.f. 

follows  J 
S.Sqs. 

Mill 

E(MS) 

S 

2 

11.1111 

+  kpal 

R 

1 

5.1361 

5.1361 

4 

+  k),a|j.  +  k^o^ 

+  k60| 

SR 

2 

31.7972 

17.3986 

+  k-j^alr  +  ^2'^ 

+  k^c4 

Error 

12 

26.0667 

2.1722 

4 

The  terms  required  for  the  corrontation  of  the  k  values  for  E(MS)  are  cornputed 
as  follows? 

,  iJ  ,  (2)M2);  ^  (5)!i(3)!  .  (l)M5)!  =  10.5833 
i    "i.  I4  8  ^ 


i_ 

Z 


Zn- . 

i^  ^  (2)M5)Ml)^  ^  (2)M3)M5)^  .  ,  .^00 
J    n.j  8  10 


2 


n..  18 


n..  18 


.2 


ll!i£  =  (2)"-^(2)M5)"^(3)"-h(1)"-h(5)"  =  3.7778 
n..  18 


-81- 


The  nine  k's  are  now  cornnuted  as  follows: 


k-L  =  i  (18  -  10.5833  -  7.5500  +  3.7778)  =  1<,8222 


k2  =  I  (9.1111  -  10.5833)  =  -.7361 


k.  =  i  {SJMu  -  7.5500)  =  -.5528 
J)  2 


\=  \  (7.5500  -  3.7778)  =  3.7722 
k^  =  i  (18  -  9.1111)=  8.8889 
k^  =  i  (7.5500  -  6.[i)jli/i)  =  1.1056 
ky  =  I  (10.5P33  -  3.7778)  =  3.)i028 
k8  =  ^  (10.5833  -  9.1111)  =  .7361 
(18  -  bJxhhh)  =  5.7778 


k9  =1 

'^Hien  the  computed  mean  squares  are  set  equal  to  the  expected  mean  squares  and 
the  resulting  equations  are  solved,  the  following  estimates  for  the  variance 
components  are  obtained : 

o%  =  2.172  =  -2.087 

a|r  -  6.593       a|  =  -3.031 

The  estimates  of  a|j,,       and  a|  do  not  agree  with  those  obtained  from  the  com- 
plete least-squares  analvsis.    Although  these  estimates  probablv  have  larger 
sampling  errors  than  those  obtained  with  the  complete  least-squares  analysis 
thev  are  unbiased  provided  all  effects  in  the  model,  except       can  be  re- 
garded as  random. 


Application  of  Method  3  of  Henderson 

With  this  method  the  sums  of  squares  for  error  and  the  interaction  re- 
main the  same  as  with  the  complete  least-squares  anali'-sis.    Also  the  co- 
efficient for  a|r  in  the  expectation  of  the  interaction  rriean  square,  ki,  re- 
mains the  bdme.    The  sums  of  squares  for  S  an^  R  are  computed  as  follows 
under  this  method : 


S.Sqs.— S  =  R(M.,Si,rj)  -  R(|i,rj) 

=  511.7036  ^  1.96.0250  =  15.6786 


«  511.7036  -  502.0000  =  9.7036 


It  should  be  noted  that  these  are  the  same  sums  of  squares  that  were  computed 
for  S  and  R  when  the  interaction  was  assumed  to  be  non-existent.    The  co- 
efficients for  a|  and  o^  in  the  expectation  of  the  mean  squares  for  S  and  R 
(k^  and  k^)  resoectivelvj,  also  remain  the  same  as  when  interaction  was  dis- 
regarded.   The  coefficients  for  cTq-^  in  the  expectations  of  the  mean  squares 
for  S  and  R,  k^'^  and  k^  are  coiriouted  as  follows: 


k'  - 


2  10 


18-2  2-2 
10     6  h 

Ik  6 

18 


.061L.86    .0218I18    -.022160  .009363 
all  L  81 9    -.06ll7!i  -.009363 
.1120|j7  -.026217 
.0671(16 


5  3 

0  0 


0  3 


68  8 
8 


31    26  30 
0     0  L 
0  25 
26  1 
30 


3h 


VJhen  the  aooronriate  subtracticns  and  additions  are  made  bv  column  and  row  N 
reduces  to: 
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68    -18     8  -8 

3h    26  2h 
60  ho 
68 

Hence, 

k«  =  13.3933  -  10.5833  =  2.P10 
^  -  I  (13.3933  -  7.5500)  =  2.922 

The  analvsis  of  variance  i-alth  the  eroectaticns  of  the  mean  squares  can  new  be 
set  UP  as  follows: 

Source  of  Variation    d.f « 
S  2 

R  1 

SR  2 

Error  12 

Estimates  of  the  variance  components  ••'^hen  ccmnuted  bv  Method  3  of  Henderson 
are  as  follows: 

o|  -  2.172  =  -1.111.1 

afj.  =  5. 61 J  a|  =  -2.058 

Three  methods  of  commuting  unbiased  estimates  of  variance  comDonents 
under  the  random  mode]  have  been  illustrated  with  thio  small  numerical  ex- 
ample.   Presumably,  estimates  of  variance  components  obtained  from  a  complete 
least-squares  a:ialysis  have  s-^aller  sar-nling  errors  than  those  obtained  with 
either  Metnods  1  o:^  3  of  Henderson      However,  short-cut  procedures  are  still 
needed  before  either  the  com^olete  least-squares  anolvsis  or  Method  3  of  Hen- 
derson will  be  feasible  for  many  sets  of  data. 


S.Cqs  M.S. 
15.6786    7.8393    a|  + 

9.7036  ".7036  a|  + 
30.2297  15.1118  + 
26.0667    2.1722  a| 


2,922  a|j.  +  5.2^:5  a| 
2»eiO  alj,  +  7.117  o'^ 
2.;03  a|r 


KQLTIPLE  Am  NESTED  CLASSIFICATIONS 


Most  sets  of  animal  breeding  data  wi.th  disproportionate  subclass  numbers 
are  classified  in  more  than  twcj-ways*    Often,  this  is  also  true  with  other 
types  of  data  where  unequal  subclass  numbers  exist.    Many  of  the  computational 
procedures  presented  in  tho  preYious  sections  are  directly  applicable  to  any 
least-squares  analysis «    On  the  other  hand,  special  techniques  are  often  re- 
quired to  complete  least-squares  analyses  which  inifoli^e  multiple  classifi- 
cations and  the  combination  of  multiple  and  nested  classifications®    In  this 
section  will  be  presented  so5v>e  of  these  special  teclmiques  and  short'-cuts  and 
a  computational  example  which  Lncludes  multiple  classificationSj^  a  nested 
classification,  interaction,  and  two  partial,  regressions  in  one  analysis.  In 
addition,  the  general  j,east-squares  analysis  is  presented  in  matrix  notation^ 


Models 

Although  there  are  many  specific  designs  for  which  the  linear  mathematical 
models  are  standard^  the  most  appropriate  for  use  in  the  analysis  of  any  given 
set  of  data  may  differ  considerably  from  xhe  standard  models*    Since  the 
accurateness  and  validity  of  conclusions  drawn  from  any  analysis  arc  lilghly  de- 
pendent upon  how  accurately  the  selected  mathematical  model  describes  the 
biologjr  involved,  the  importance  of  selecting  the  appropriate  model  can  not  be 
overemphasiaeda    On  the  other  hand,  it  is  doubtful  that  all  assumptions 
necessarjr  for  tests  of  significance  ana  estimation  procedures  to  be  strictly 
valid^  from  a  mathematical  viewpoint^  mil  evei  be  entirely  satisfied*  Never- 
theless, every  effort  should  be  made  to  select  the  model  which  does  allow  the 
assumptions  required,  with  respect  to  homogeneity  of  variance  and  independence 
of  errors,  to  be  most  nearly  satisfied*    At  the  same  time,  however,  the  model 
should  not  be  so  complex  that  the  analysis  can  not  be  completed  at  a  reasonable 
cost. 

When  unequal  subclass  numbers  exist  and  constants  are  fitted  for  a  set  of 
effects  (either  main  effects,  interaction  effects  or  regression)  which  really 
has  no  influence  on  the  variability  in  y,  the  constants  for  other  sets  of 
effects  that  are  estLoated  at  the  same  time  and  the  tests  of  significance  for 
such  effects,  although  unbiased,  may  be  very  inefficient.    Hence,  in  the 
selection  of  the  model  for  the  analysis  it  is  import^ant  to  exclude  from  the 
model  sets  of  effects  which  are  known  not  to  effect  the  variability.    At  the 
same  tima^  however,  it  is  important  to  include  in  the  model  all  effects  which 
really  do  effect  the  variability  of  y  in  order  that  the  estimates  obtained  in 
the  analysis  will  be  unbiased*    This  places  the  investigator  in  somewhat  of  a 
quandary,  A  good  rule  to  follow  is  to  place  a  set  of  effects  in  the  model  when 
some  doubt  exists  concerning  whether  such  effects  are  really  zero  or  not. 
Methods  are  available  (and  will  be  presented  in  this  section)  for  deleting  a 
set  of  effects  after  it  has  been  found  that  the  constants  for  the  effects  with- 
in the  set  probably  equal  zero. 


The  specific  model  for  consideration  at  prest3nt  is 
^ijkl  =  a  +  ai  +  bij  +  ck  +  (ac)ik  +  dDijkl  +  gGijM.  +  ©ijkl  (^) 
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i  =  1,  2,  ...J  p        1  =  1,  2,  nijj^ 

j  =  1,  2,  qj_       s  =  Ho,  of  AB  subclasses 

k  =  1,  2,  r        t  »  No,»  of  AG  subclasses 

where? 

yijkl  "  "^^^  1     observation  jji  tne  k"*'^  C  class  and  in  the  j'''^  B  class 
within  the  i^-h  a  class  e 
a  "  the  population  mean  when  equal  fiBquencies  exist  in  all  sub« 
classes  and  =  Gijkl  ~  ^* 

aj^  =  effect  of  the  1"^^  A  class  ^ 
b^j  =  effect  of  the        B  class  withi.-.  the  i"^^  A  class^ 
=  effect  of  the  k"*^^  G  class, 
d  "  partial  regression  of  the  Yij^l       "^^^  %3icl» 
I^ijkl  '  ^  independent  continuous  variable* 

g  =  partial  regression  of  the  yijkl       '■^^^  ^ijkl* 
^ijkl  *  ^  second  independent  con-^inuous  variable » 
eijkl  -  random  errors  which  are  assuitied  to  be  NID(0,o|),  The 

normality  asstumption  is  required  to  validate  teste  of  signi- 
ficance. 

Tie  population  mean  when  equal  frequencies  exist  is 

[x=a  +  dD  +  gG 

where  D  and  G  are  the  means  ei  the  Dij^l  and  the  '^-ijkl«    However,  the  popu- 
lation mean  for  any  combination  of  values  for  D  and  G  can  be  computed  if  a,  d, 
and  g  are  knotm. 

The  model  (5)  does  not  include  a  term  for  the  interaction  of  t-he  bij  with 
the  Ck,  (bc)ijk*  wliich  could  exist.    It  is  assumed  here  that  a  priori  infor~ 
mation  cleaily  indicates  that  the  (bc)ij}^  effects  equal  zero.    However,  in  the 
completion  of  the  least-squares  erialysis  under  model  (5)  a  procedure  will  be 
given  for  testing  the  significance  of  the  (bc)ijk  interaction  affects.    If  the 
a  priori  information  was  dncorrect  arid  the  interaction  (BCsA)  is  significant 
the  estimat  3  of  constants  for  the  other  effects  are  biased. 

The  procedures  to  be  folloi^ed  in  the  analysis  of  a  set  cf  data  under  modelj 
(5)  depend  on  several  factors.    If  all  effects  ir  the  models  except  the  random 
errors,  are  regarded  as  fixed  a  complete  least-squares  analysis  is  preferred. 
In  this  case,  the  inverse  of  the  complete  variarice-covariance  matrix  would  be 
required  if  individual  comparisons  are  to  be  made  among  least-squares  means. 
Also,  ±1  all  effects  are  fixed  the  number  of  independent  least-squares  equat- 
ions is  not  likely  to  be  large  and  therefore^  the  complete  sjialysis  is  not  too 
difficult  to  accomplish.    However^  the  effects  for  a  nested  classification, 
such  as  bj_jj  usually  represent  a  random  sample  of  those  effects  and  not  a 

selected  group.    In  this  case,  the  niimber  of  equations  is  likely  to  be  large 
and  direct'  inversion  of  all  independent  equa'^.ions  may  be  impractical.    The  mere 
fact  that  a  large  numbei  of  original  least-squares  equations  exist  does  not 
prohibit  a  least-squares  analysis  from  being  completed.    Short-cut  methods  are 
available  whicli  allow  the  investigator  to  obtain  unbiased  estimates,  and  in 
many  cases  the  most  efficient  estimates,  of  all  constants  and  most  tests  of 
significance  desired  even  though  the  number  of  equations  is  large. 


Oftentimes  when  model  (5)  is  appropriate  both  p  aiid  r  are  small  but  the 
are  large  &    When  this  is  true  the  equations  for  a  +       +  b^-i  should  first  be 
absorbed  into  the  equations  for  Cj^,  (^^)±k>  ^»         S  e^ren  though  the  inverse  of 
the  complete  variance-covariance  matrix  is  desired.    In  effect,  the  inverse  of 
the  complete  coefficient  matrix  is  then  obtained  more  easily  by  a  matrix- 
partitioning  procedure  than  if  the  complete  matx^  inverse  is  obtained  by  di- 
rect inversion^    In  order  to  present  these  procedures  (as  well  as  others)  in 
general  terms  which  apply  to  any  least-squares  analysis,  model  (5)  will  be 
given  in  vector  notation  and  then  associated  with  general  matrix  notation. 
Model  (5)  in  vector  notation  may  be  written  as  follows: 

y  =  a+  A  +  B  +  C+AC  +  dD+gG  +  e 

where 

nil 

1112 


Pllll  \ 

M 

^1112  ^ 

a 

^1 

y  = 

» 

+ 

.  + 

« 

y'pqrnp.jr  / 

s 

1^1111  \  /®1111 
S^lllf    \  Q1112 


A  general  fonri  for  any  model  is 
y  =  XB^  +  ZB2  +  e  «  < 
If  p  =  2,  q^  =  3  for  all  i,  r  = 
under  model  (5) 


pqrnpqr 


2  and  t^±^]^  =  1  in  all  cases  for  an  analysis 


and 


+  ag  +  b22 
a  +  a2  b22 
a  +  a2  +  b23. 
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when  the  constants  for  a,  a^,  and  h±^  are  combined  into  a  subclass  constanto 
This  would  be  necessary  iji  order  to  absorb  all  equations  for  the  a+ai+bij«  A'. 


B2 


Constants  to  be  fitted  have  been  arbitrarily  divided  into  two  sets.    The  B 


of  constants  includes  only  those  constants  that  can  be  conveniently  absorbed,  ^t  I 
or  a  and  all  other  constants  which  represent  the  class  or  subclasses  effects » 
The  X  matrix  is  therefore  composed  of  only  zeros  and  ones  giving  the  class  or 
subclass  within  which  each  of  the  observations  falle  The  B2  set  of  constants 
includes  all  constants  to  be  fitted  other  than  those  to  be  absorbed.  The  need 
for  grouping  all  constants  to  be  fitted  into  these  two  sets  will  become  clear 
as  the  absorp'Aon  process  and  completion  of  the  least»squares  analysis  is  pre' 
sented  in  general  matrix  notation. 


Least-Squares  Equations 
The  leas*.-squares  equations  for  model  (5)  are^  given  in  tabular  form  below: 
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a 

(ac)ik 

d 

g 

ni.k 

0 

"i*k 

Gi.« 

Yi.. 

Hi* 

0 

0  -'-J® 

J. 

Yii. 

0 

0  *  *k 

K 

G» 

A. 

Y.«k 

(ac)ij,t 

0 

O^i.k 

Di.k 

•^iok 

Yi.k 

ds 

ZZZZDiikl 
ijkl 

ZZZZDijkiGijkl 
ijkl 

ZZZZGi^kl 
ijkl 

ZZZZDijkiyijkl 
ijkl. 

ZZZZGijkiyijkl 
ijkl 

Elemants  to  the  left  of  the  main  diagonal  in  the  coefficient  matrix  have  been 
onrLtted  since  they  would  be  the  same  as  those  given  on  the  right  of  the  main 
diagonal. 

Before  direct  inversion  of  the  least-squares  coefficient  matrix  can  be 
made  or  a  unique  solution  of  the  equations  can  be  obtained  it  is  necessary  to 
impose  certain  restrictions »    An  appropriate  set  of  restrictions  is  that  ?  » 

^  ^-^  ^  ^ 

Z  b^j  =«  Z  ck  "  2(ac)i]^  =  2(ac)ik  «  0.  The  subtractions  and  additions  required 
j  k  3.  k 

among  the  coefficients  and  RHM's  in  the  least-squares  equations  when  these  re- 
strictions aje  imposed  have  previously  been  described  except  fgr  the  restrj„c- 
tion  that  Z  bij  =  0»    Wten  this  restriction  is  imposed  on  the  bij  the 

coefficients  for  each  of  the  h^q^  are  subtracted  from  the  other  coefficients 
for  bij,  but  only  within  the  i"^^  A  class  of  effects,  i»e«,  the  coefficients  for 
the  last  bij  in  each  A  class  are  subtracted  from  the  coefficients  for  the  other 
bij  within  the  corresponding  A  class ♦    After  this  subtraction  by  columns  the 
same  type  of  subtraction  is  completed  by  rows,  in  \*dch  case  the  RHM's  are  also 
involved. 

If  the  total  number  of  equations  for  a,  ai  and  bij  is  large  the  inverse  of 

the  complete  variance-covariance  matrix  and  the  estimates  of  a,  ai  and  bij  can 

often  be  obtained  most  easily  by  an  indirect  but  general  method.    The  least- 
squares  equations  for  the  general  model  (6)  in  matrix  notation  are 


where,  for  the  present  example,  D  is  a  diagonal  matrix  of  the  nj      of  order 
s  X  S5  N  is  the  matrix  of  coefficients  for  cj,,  (ac)ii^,  d  and  g  in  the  a+ai+bij 
equations  of  order  s  X  (r+t+2)5  N'  is  the  tragspose^of  N  (rows  and  columns  re- 
versed) j  S  is  the  matrix  of  coefficients  for  cj^,  (ac)ij^,  d  and  g  in  the  cj^-, 
(ac)^,  d  and  g  equations  of  order  (r+t+2)  X  (r+t+2)j  Y^^  is  the  matrix  of  RilM's 
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far  the  a  +       +  b^j  equations,  the  of  order  s  X  u  with  u  being  the  mm-^ 

ber  of  dependent  variables |  Y2  is  the  matrix  of  RHM's  for  the  oti^r  equations 
of  order  (r+t+2)  X  u« 

Comput/ation  of  the  Matrix  Imrerse  and  Estimates  of  Constants 

Programs  are  generally  available  on  high  speed  electronic  computers  for 
inverting  matrices  directly  -when  there  are  no  more  than  30  or  iiO  equations.  A 
few  special  programs  are  available,  for  some  computers,  which  -will  invert 
matrices  of  considerably  larger  size.    However,  the  cost  of  computing  the  in™ 
verse  matrix  directly  when  there  are  more  than  about  kO  equations  ijs  usually  verj 
expensive.    With  large  least-squares  matrices  it  is  often  best  to  conqjute  the 
inverse  matrix  by  an  indirect  procedure^/   Suppose  the  matrix  inverse  of  the 
variance-covariance  matrix  under  model  (6)  is  partitioned  as  follows: 


r  mi 
I' 


With  this  partitioning  of  the  inverse  it  can  be  shown  that 

c  =  (s  -  n«D"1n)-i 

G  =  -  D"1nC 

A  -  D-1  (I  -  NG« ) 

where  I  is  the  identity  or  imit  matrix.    It  must  be  realized,  of  course,  that 
restrictions  must  be  imposed  on  the  S  -  N*D°1n  equations  before  the  inverse  C 
can  be  obtained*    The  changes  made  in  the  S  -  N'D"1n  equations  as  a  result  of 
restrictions  inposed,  must  also  be  made  in  N  or  the  additional  rows  and  col" 
umns  must  be  added  to  C  before  computing  G  and  Ae 

Although  the  indirect  procedure  described  above  for  computing  an  inverse 
is  geaeral  and  can  be  applied  to  any  set  of  symmetrical  equations  it  is  usuallj 
most  useful  for  least-squares  analyses  when  D  is  a  diagonal  matrix.    When  this 
is  true,  the  procedure  for  computing  the  "neF"  coefficients  in  the  remaining 
equations  after  the  equations  for  a  set  of  classes  or  subclasses  are  absorbed 
is  given  by  S  »  N»D"3-N»    ^.th  D  as  a  diagonal  matrix  the  calculation  of 
S  -  N'D"^N  can  be  completed  m.cst  readily  with  the  aid  of  simplified  formulas. 
For  e::ample,  under  model  (5)  tne  absorption  of  the  equations  for  a  +  aj_  +  b^j 
is  accomplished  by  computing  the  following  "new"  coefficients  for  cj  ^  isLc)^}^^ 

C(ckCk)  =  n«,v  -  11 

^jk^ijk' 
^3  ^ij 


C(ckCk») 


3  ^ij. 


8/  This  indirect  pi-ocedure  was  firat  g^Lvsir  to  the  author  by  Dr.  G. 
Henderson  of  CoJ-nell  University^ 
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C[c^(ac)^^J  .  cr(ac).^(.c)^^J  -  'f^^ 
C(c^d)  »  D..     -  TZ^Ud^ 

C(c  g)  »  G..     »  EE^iJ^^ 
ij  '^ij. 


J        1  j  » 


C(d  d)  «  LZZZjf  „EE^ 
ijkl    ^^^1  ij'^ij, 

C(d  g)  -  2ZEE  D      ,G.     ,  -  ^ 


C(g  g)  »  ZSEE  Gf ,  »  EZ 


0*= 

ijkl  ^ij^^i      ij  n^^^ 

Under  the  general  model  (6)  the  "new"  right  hand  members  of  the  equations 
for  the        set  of  constants  are  obtained  from  Yg-N'D'^Yj^;  whereas,  under  the 

specific  model  (5)  the  "new"  RHM's  for  the  c^,  ^®^^ik'  ^         ^  equations  are 

n.    Y.  , 

S(c,  )  »  Y..,    -  ZE  ^1* 
k  k  n^j^ 

S(ac)„.  -  Y,  ,   -  E  ^iilsliii. 


J  ^j. 

°ii  ^ii 

S(b)  -  EZEE  D, ,y,     ,  -  EE  '* 


ijkl    ijkl^ijkl      ij  n^j. 

G  Y 

S(g)  =  EEE^^G       y         -  EE  J* 

ijkl    ij*^!  ^j*^l      ij  "ij. 

It  should  be  noted  that  ni j ,  appears  in  the  denominator  in  each  of  the 
N'D"^N  and  N'D"^Yi  terms  given  above  for  model  (5)  and  that  a  product  or  square 
appears  in  the  numerator  in  each  case.    This  simplifies  the  computations 
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required  in  the  absorption  of  a  set  of  class  or  subclass  effects, 

When  D  is  a  diagonal  matrix  the  raatrjjc  multipli.cation^  reqaii'ed  to  com- 
pute G  and  A  are  greatly  sinplified.    Hence,  the  matrix  inverse  to  the  com- 
plste  variance-covariance  matrix  can  often  be,  (Obtained  mth  a  ndniiwim  of  tloie 
ana  labor  even  with  a  large  number  of  least-squares  equations. 

Estimates  of  the  constants  included  in  the       set  under  model  (6)  may  be 
obtained  from  ^ 

B2  =  (S  »  N'D'^)"!    (Yg  -  N5D"-^Yj)« 

If  G  has  been  computed  these  e^ti^nates  may  also  be  obtained  from  Y^G  +  Yj^G  or 

G'Y^  +  CYge    When  the  estimates  jf  constants  included  in  B2  are  computed  ii 

both  ways  a  check  is  provided  on  the  comp-itation  of  Ge 

Estimates  of  the  constants  included  in  the  B^  set  may  be  obtained  by  soIt;*- 
ing  the  equations  for  i:m  constants  after  estimates  of  Uie  B2  ccustants  have 
been  computed^  i.e*^ 

If  the  G  and  A  segments  of  the  inverse  have  been  computed  these  constants  may 
also  be  obtained  from  ^JA  +  YJG«  or  AY^^  +  GY^,    These  different  methods  of  com- 
puting B^  provide  a  mea  is  of  checking  the  ca^ julations  involved  in  computing  G 
and  A« 

In  order  to  sjmplify  the  absorption  process  and  the  computation  of  A  and 
G  i«..  is  necessary,  under  model  (^),  to  place  the  constants  c,  s-  and  bj_j  to- 
gether-   Hence,  for  ohi^  model,  B-  consists  of  the  least-sqx^ares  subclass 
means,  the  a     a^  +  ^ii^    To  obtain  a  separation  of  these  constant  estimates 
one  can  impose  the  restrictions  that  2  aj_  =  5  Oj^j  ^  0  and  compute  averages  of 


the  a  +  aj_  +  bj^^,  i^e.^  2(a+aj^+bij) 


a  ■!•  aj_ 


qi 


ZZ(aH-ai+biO  a^) 
_  ^  i 

2qi  P 
i 


The  estimates  for  each  of  the  aj^  and  b^  can  then  be  computed, 

...  /V  A  ^ 

With  model  (5)  the  inverse  matrix  segments  A  and  G  apply  to  the  a+a^+bij 

rather  than  to  a,  the  aj_  and        separately.    In  many  cases  this  may  be  all 

that  is  required^    However,  the  transformatioxi  of  these  invgrse  segments  to 
obtain  the  inverse  elements  which  apply  dxrectly  to  a,  the       and  bij  is  easily 
accomplished.    The  general  method  of  transfomiiiT  sub-inverse  Jiatrices,  such  as 
A  and  G,  was  presented  in  previous  sections  and  mil  not  be  repeated  here  for 
this  special  casu. 


Goaputatlon  of  Sums  of  Squares 

Wheti  constants  are  fitted  for  all  degrees  of  freedom  among  a  set  of  sub- 
classes (two  factor,  three  factor,  «tc«)  except  the  highest  order  interaction, 
the  least-squares  sum  of  squares  for  that  interaction  can  be  computed  by  a  dif- 
ference, as  w&s  shown  for  the  two-way  classif icationo    Hazel  (1946)9./  g*ve  a 
method  of  computing  the  least-squares  sum  of  squares  for  a  two-factor  inter- 
action when  constants  were  fitted  for  two  main  effects  and  a  parti&l  regres- 
sion*    The  method  of  Hazel  requires  that  the  expected  subclass  means  be  com- 
puted from  the  constant  estimates  obtained  when  the  highest  order  Interaction 
is  assumed  to  be  non-existent.    The  sum  of  squares  for  interaction  is  then 
computed  by  multiplying  the  squared  difference  between  the  observed  subclass 
mean  and  the  expected  subclass  mean  by  the  number  of  observations  in  the  sub- 
I  class  and  accumulating  over  all  subclasses.    This  method  of  computing  the  sum 
'  of  squares  for  the  highest  order  interaction  when  constants  for  all  other  ef- 
fects among  the  smallest  subclasses  have  been  fitted  is  easily  extended  to  more 
coaiplex  analyses.    However^  it  should  be  pointed  out  that  no  method  is  avail- 
able for  obtaining  unbiased  tests  of  significance  for  two  or  more  interactions, 
unless  constants  are  fitted  for  all  interactions  except  one.    Even  then,  if  the 
test  of  significance  for  the  interaction  omitted  (usually  the  highest  order  in- 
teraction) is  significant  and  constants  were  not  fitted  simultaneously  for  this 
I  interaction  the  estimates  of  constants  for  all  other  effects  are  biased, 

j  The  appropriate  error  sum  of  squares,  for  use  in  testing  the  significance 

||  of  the  BC:A  interaction  omitted  in  the  least-squares  analysis  under  model  (5), 
]!  is  computed  from  ^2 

ijkl  ^^^^      ijk    iik  ijkl    ^^^^  ^^^^ 

"  ^'tez  <^ijkiyijki  f?g  ~^^~^> 

ijkl  ijk  ijk 

[where  d'  and  g'  are  the  partial  regressions  of  y  on  D  and  G  computed  on  a  with- 
in ABC  subclass  basis. 

The  sum  of  squares  for  BCtA  interaction  can  most  easily  be  computed 
directly  from 

R[«,a^^b^^,c^,(ac)^^,(bc)^^^,d,gl  -R[a,a^,b^^,c^,(ac)^^,d,g] 

where 

R[a.ai,blj.V(ac)^j^,(bc)^jj^,d,g]  =  ^'iT^  +  R(d',g'), 

rather  than  by  the  indirect  method  of  Hazel. 

The  error  sum  of  squares  when  the  BC:A  interaction  does  not  exist  is  com- 
puted from  2  of^       V  /    \      J  1 
^^iikl  ■  Pf«,ai,bi^,cj^,(ac)^j^,d,g] 

where  the  total  reduction  in  sum  of  squares  due  to  fitting  all  constants  is 
computed  by  summing  the  products  of  the  constant  estimates  and  the  correspond- 
fjing  RHM's  in  the  usxial  manner «,    The  mean  square  resulting  from  dividing  the 
9^/  Hazel,  L.  N,    The  covariance  analysis  of  multiple  classification  tables  with 
unequal  subclass  numbers^    Biometrics  Bulletin  2:21-25.  1946. 
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Slim  of  squares  obtained  in  this  way  by  ths  degress  of  freedom  for  error  ie 
biased  upwards  if  the  BC:A  interaction  exists.    Heace,  the  error  meaa  square 
used  by  Hazel  to  test  the  significance  of  his  two- factor  interaction  was  in- 
appropriate and  would  have  been  biased  upwards  if  the  interaction  had  existed. 

The  least-squares  (or  adjusted)  sums  of  squares  for  A,  B:A,        AC,  re- 
gression d  and  regression  g  may  be  coaputcd  by  the  general  direct  aiethod, 
B'Z-1b,  previously  described,  when  the  icverse  of  the  complete  variance- 
covariance  matrix  is  available.    The  adjusted  sums  of  squares  for  those  sources 
of  variation  which  have  only  a  few  degrees  of  freedom  can  almost  always  be 
computed  most  easily  by  this  direct  method.    In  avany  cases,  however,  when 
model  (5)  is  applicable  there  will  be  a  considerable  nusuber  of  degrees  of  free- 
dom for  B:A  as  compared  with  each  of  the  other  sources  of  variation.    In  this 
caae  it  may  be  spmewhat  easier  to  compute  the  sum  of  sqiaares  for  B:A  by  the 
indirect  procedure,  i.e.,  from 

R£a,a^,b,^  ,c^,(ac)^^,.  ,g3     -  Rfa,a^,c^, (ac)^.^,d,gj  . 

The  decision  concerning  which  procedure  to  use  in  each  case  is  largely 
determined  from  the  rank  of  the  matrix  inverse  S2gment(s)  that  must  be  invert- 
ed if  ths  direct  method  is  used    as  compared  with  the  number  of  equations 
which  must  be  solved  if  the  indirect  method  is  used.    For  example,  if  p  "  2, 
q^^  «  10,        ■  ^^f         r  «  3  the  indirect  method  should  definitely  be  used  to 

compute  the  stim  of  squares  for  B:A,  since  it  would  require  the  solution  of 
only  8  equations.    The  direct  method  would  require  the  inversion  of  one  9x9 
matrix  and  one  14x14  matrix  as  well  as  the  matrix  multiplications,  B'Z'J-B,  to 
obtain  the  same  sum  of  squares. 

Least-Squares  Means,  Standard  Errors  and  Individual  Comparisons 

When  all  effects  are  fixed,  except  for  the  errors,  the  investigator  is 
primarily  concerned  with  the  least-sqvsrcs  means  with  their  standard  errors 
and  tests  for  the  significance  or  contrasts  among  the  means.    The  least- 
squares  me-sns  are  constructed  from  th*.  constant  estimates,  e.g., under  model 
(5)       ^     ^     ^  ^     ^  , 

^  ^        U-a+dD  +  g5  -  Overall  mean. 

[i  +  ai  -  --  --  --  --  --  --  --    Means  of  the  A  classes. 

+  a^  +  b^j  -  --  --  --  --  --  -    Means  of  the  AS  subclasses. 

ji  +  cjj  -  -»-_------„-„».    Means  of  the  C  classes. 

^     \  ■'■^'^^^^ik"'"  "°    Means  of  the  AC  subclasses. 

In  each  case,  these  means  estimate  the  class  or  subclass  arithmetic  mean  that 
would  be  expected  if  a  new  set  of  data  was  taken  in  the  same  manner,  except 
that  equal  subclass  numbers  would  be  achieved  and  the  mean  of  the  Dij^l  and 
G"»jkl  for  aH  classes  and  subclasses  would  eqtjal  5  and  G.    When  unequal  fre- 
quencies and  variations  in  the  means  for  D  and  G  are  a  result  of  chance  rather 
than  a  result  of  the  effects  being  considersd^  these  means  (which  are  referred 
to  here  as  the  least-squares  means)  are  the  ones  desired.    Unequal  weighting  of 
the  individual  effects  in  obtaining  expected  means  msy  sometimes  be  desirable. 
For  example,  if  the  a^  represent  sire  effects  anr  the       represent  age  of  dam 
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effects  one  could  logically  be  interested  in  computing  the  expected  sire  means 
vhen  each  sire  is  mated  to  different  numbers  of  dams  of  the  different  age 
classes.    In  this  case,    ^  ^  _     ^  „  ^ 

|i»=a  +  dD+gG+Z  w^c^ 
k 

t'jhere  the  are  the  weights  to  be  given  each  age  of  dam  effect  based  on  the 
number  for  each  age  of  dam  class  that  is  to  be  considered  and  Z  Wj^  =  1,  The 

variance  of  jl'      a^  would  be  different  from  the  vairLance  of  [1  +  a^,  of  course, 

but  could  still  be  computed  from  the  inverse  elements  of  the  matrix  inverse  by- 
making  use  of  the  general  fonnula  for  computing  the  variance  of  a  sum. 

Standard  errors  for  least-squares  means,  or  for  differences  among  the 
constant  estimates  within  a  set,,  are  computed  from  the  inverse  elements  and  the 
estimate  of  of  in  the  usual  manner  that  has  previously  been  described, 

Duncan's  Multiple  Range  Test,  as  modified  by  Krame^i2/may  be  used  to  make  pair- 
wise  comparisons  or  tests  of  significance  for  orthogonal  comparisons  can  be 
completed.    Methods  for  completing  these  tests  in  least-squares  analyses  with 
fixed  effects  were  also  described  in  previous  sections. 


Estimation  of  Variance  Components 

When  all  effects  in  model  (5)  are  random,  except  a,  d  and  g  the  expec- 
tations of  the  mean  squares,  S(MS),  obtained  with  a  complete  least-squares 
analysis  are  as  follows? 


A 

d,f, 

p-1 

of  - 

S(MS) 

B:A 

s-p 

C 

r-1 

AC 

t-p-r+1 

Regression  d 

1 

Piegression  g 

1 

Error 

n,,.-s-t+p+3 

4 

These  expectations  point  out  that  exact  tests  of  aignificance  are  available  for 
each  of  the  two  regression  coefficients,  the  AC  interaction  and  the  B:A  effects. 
Since  k-j,  k^,  and  ky  are  all  different  coefficients,  with  unequal  subclass  num- 
bers as  are  also  k^  and  kg,  no  exact  test  of  significance  is  available  for  the 
effects  of  A  or  C,    If  the  interaction  of  A  and  C  is  non-existent  the  error 


10/  Kramer,  C.  Y.  1957.  Ibid. 


mean  square  is  the  appropriate  mean  square  for  testing  C  effects.  However, 
since  each  sum  of  squares  in  the  complete  least-squares  analysis  has  been 
adjusted  for  possible  effects  of  all  other  sources  of  variation  considered 
this  may  give  an  inefficient  test  depending  largely  on  the  amount  of  unequal 
frequencies  existing  in  the  data«    In  fact,  if  any  of  the  effects  included 
in  the  model  are  really  non-existent  all  tests  of  significance  and  estimates 
of  variance  components  are  less  precisely  determined  than  would  be  the  case 
if  such  effects  are  omitted  from  the  model  and  the  data  reanalyzed^    A  short™ 
cut  matrix  multiplication  procedure  for  use  in  deleting  sets  of  effects  is 
presented  in  the  next  major  sub»§ection  so  that  more  precise  re.^ults  can  be 
obtained. 

Since  the  effects  of  the  regressions  are  fixed  there  is  no  reason  for 
computing  kj^  and  All  other  k's  are  needed  in  order  to  obtain  the  es- 

timates of  the  variance  components.    The  coefficients  k^,  k^,        and  k^  may 

be  obtained  by  the  direct  method,  as  previously  described,  at  the  seaie  time 

the  sums  of  squares  are  computedo    However,  if  the  number  of  B  classes  in 

the  different  A  classes  is  large  relative  to  the  number  of  A  and  C  classes 

it  will  be  best  to  compute  k,  and  the  sum  of  squares  for  B:A  by  the  indirect 

o 

methode    The  remaining  coefficients  k^,  k^  and  kg  can  only  be  obtained  by  the 

indirect  method  at  present  if  any  of  the  n,  ,       0»    If  all  n.  ,.2>0,  k.  ■ 

—  k„  and  k_  »>  °"  k^»    Evea  though  all  n.  ,  ^0,  kj,  must  still  be  computed  by 
p     J  /       T     y  1»K  o 

the  indirect  method » 

Mixed  Model 

In  many  problems  the  a^  and  h^^  effects  are  random  and  the  c^^  effects, 

as  well  as  the  regressions,  are  fixed.    For  this  case,  Hendersonii./  has 
presented  a  short-cut  method  (  referred  to  by  him  as  Method  2)  of  computing 
unbiased  estimates  of  the  variance  components  when  p  and  the  q^  are  large. 

The  method  is  also  applicable  when  all  effects  are  randcaa. 

Method  2  of  Henderson  and  the  complete  least-squares  analysis  give 
unbiassed  estimates  but  not  "precisely  the  same  estimates"  of  the  variance 
components . 


Using  Method  2  of  Henderson,  the  analysis  of  a  set  of  data  under  model 
(5)  when  the  a^,  the  b^^  and  ^^^^^         rendom  and  all  other  effects  are 

fixed,  ma>  be  coispleted  with  the  following  steps: 


1_1/  Henderson,  C,  R.  1953,  Ibidc 
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step  Ic     Absorb  the  equations  for  a  +  aj_  +  b^j  into  the  equations  for 

the  ci^g  the  (ac)jj^,  d  and  g  In  the  manner  described  above.  In 

general,  raatrix  notation  for  ary  analysis  this  step  is  accomplished 
by  canip-'iting  the  S  ~  N»D*'iN  and  the  Y2  -  N'O'^-y^^  values. 

Step  2,      Impose  the  restrictions  (or  constraints)  on  the  constant  esti~ 
mates  ?»o  that  Z  Cj^  -  2(ac)jj^  "  S(ac)j_j^-  ~  0  and  complete  the 

k    "     i  k 
appropriate  subtractions  and  additions  by  columns  and  rous  among  the 
"nev:*'  coefficients  and  "new"  right  hand  members. 

Step  3»  .    Invert  the  reduced  coefficient  matrix  for  the  cj^,  (ac)ii^,  d  and 
g  and  compute  the  estimates  for  these  constants  eD+-her  from  a  direct 
solution  of  th^  equations  or  from  C"^  ^2°N'^""^l)>  C"^  is  the 

matrix  inverse  aad  (Y2°N'D-1Yi)  refers  to  the  right  hand  members » 

Step  U,      Compute  the  least-squares  sums  of  squares  for  C,  AC,  Regression 
dj  Regression  g  and  Error,    The  error  sum  of  squares  is  computed 
from 

Error  S.  Sqs.  =  ZZZZy^\i  '  ^  R*ai/Dij,ck,  (ac)^^^^,^^ 

ijkl 

Y?  . 

=  Z2Z2y.^^j^3_  -11^'  B«(Y2  -  K^D"1Yi) 
ijkl  ^         ij    "-J*  2 

vhere         is  a  row  vector  of  the  cj^> (ab) d  and  g  and  Y2  N'D''%2 

is  a  col^iBTin  vector  of  the  corresponding  RHM's  in  the  reduced  equa- 
tions « 

In  most  cases,  the  least-squares  sums  of  squares  for  C,  AC  and 
the  two  regressions  can  most  easily  be  computed  by  the  general 
direct  method^  B'Z"1b,  as  previously  described. 

Step  5>     Set  up  the  analysis  of  variance  for  the  fixed  effects  and  com'- 
pute  the  F  values  for  test  of  significance  as  indicated  below: 

Source  of  Variation 


d.f. 

E(MS) 

r-1 

of  + 

kiiofc  +  k^CT§ 

t-p-r+l 

oi  - 

k30ac 

1 

k2o| 

1 

o^  + 
e 

^l1 

,-s-t+p+3 

c 

AC 

Regression  d 
Regression  g 
Error 

If  the  interaction  of  A  and  C  is  likely  to  be  non-significant  and 
smsll,  one  should  first  compute  the  sums  of  squares  for  error  and 
the  AC  interaction.    When  the  F  test  clearly  indicates  that  the 
(ac)jj^  effects  are  near  zero,  the  (ac)ij^  effects  can  be  deleted 
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before  completing  the  test  of  significance  for  the  fixed  effects,  A 
short-cut  procedure  for  use  in  deleting  a  set  of  effects  is  given  in 
the  next  sub-sect_on. 

No  exact  test  of  significance  exists  for  the       fixed  effects 
vhen  the  aj_  effects  are  random  and  the  (ac)j_j^  dc  not  equal  zero. 
However,  the  c^  are  iinbiasea  estimates  of  the  effects  of  the  G 
classes  and  may  therefore  be  ased  to  adjust  either  these  data  or 
other  data  of  a  similar  nature.    If  the  AC  interaction  is  found  to 
be  non-significant  and  small  and  is  then  deleted  irom  the  analysis, 
an  exact  test  of  significance  can  be  made  for  the  cy.  effects »  In 
this  case,  mean  separation  procedures  for  pairwise  comparisons  or 
tests  of  significance  for  orthogonal  contrasts  among  the  c^  may  be 
completed  too  as  has  previously  been  outlined^ 

Step  6.     Compute  the  adjusted  totals  (Y!^'^  ^  for  the  AB  sub-classes  as 

follows  t 

^±3.     ^ij-  ■  i'^Ok       -  ^±3k^^^hk  -  ^  -  S  Gij. 

The  general  fc-Taula  in  matrix  notation  for  computing  the  adjusted 
totals  for  the  classes  or  subclasses  absorbed  is  Y^'  *       "  ^2' 

The  adjusted  totaLs  for  the  A  classes  are  then  computed  by  summing 
the  YI',    over  j,  i^e., 

Y»  >    =  Z  YJ » 

i..  j 

Step  7»     Compute  the  "adjusted"  dums  of  squares  for  A  and  B:A  ana  the 

e(ms). 

The  adjusted  suns  of  squares  are  computed  as  follows s 
Adjusted  S,  Sqs,  -  -  A«  =2  -  fnff 


i  ''i. 


2  2 

V!i  Y!« 

"  1.1.  1< 


Adjusted  S,  Sqs,  -  -  B« :A'  »  Zl  -  2  ^ 

ii  i 


These  adjusted  sums  of  squares  for  A  and  BsA  will  not  be  the  same 
as  the  least-squares  sums  o^  squares  since  this  procedure  in 
effect,  assuaies  that  the  adjustments  for  the  effects  of  Cj,,  (ac)xic^i, 
d  and  g  are  made  without  error.    Since  the  c^,  (ac)^,  d  and  g  are' 
measured  with  error,  the  expectations  of  the  mean  squares  for  A» 

and  B» :A'  are      ,  . 

d.i  i 

V  P-1  ^12^  "  ^13^sa  ^lli^ 

B' :A'      s-p  ^lO^e 
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The  unusual  featiur's  of  these  expsctations  of  mean  squares  is  the 
coefficient  for  or:  In  both  the  1'  and  the  B«  !fA»  mean  squares.  Usu- 
ally this  coefficient  is  milty  in  each  case^^  but  since  k-j_Q  and 

are  not  unity  it  is  "ec-'csc.."-  +o  compute  them  in  order  to  obtain  un- 
biased estimates  of      ;    a."     i/ ^    In  addition^f  it  is  necessary  to 

coirpute  kj^j  k-^^-:^,  aiid  k^|^„    These  last  three  coefficients  are  com- 
puted oy  the  standard  procedure  for  computing  coefficients  of  vari- 
ance components  with  unequal  frequencies  in  the  nested  classifi- 
cation i.e«5 


The  calculations  required  in  the  computation  of  k.^^  and  k^  are 
somevhat  more  involved  than  in  the  computation  of  k-jj_^  k^^  and  k^j^. 
However^  at  tliis  stagej,  inany  of  the  values  required  to  compute  k^^ 
and  k-|^2  hax'-e  been  computed  during  the  absorption  of  the 

a  +  aj_  +  b^j  equations  and  the  inversion  of  the  reduced  matr^jc. 


The  C-'-J  are  the  inverse  elements  in  the  iriatrix  inverse  to  the 
reduced  least  squares  coefficient  matrix  after  absorption.    The  P!^^ 

elements  are  the  values  obtained  in  the  absorption  of  the  a+a^+b^^ 
equations,  i»e.,  the  N'D'^-^N  values^  the  P!  .  are  the  values  obtained 

from  the  M'D  -^N  multiplication  if  the  b^^  effects  are  deleted  from 
the  model  and  only  the  a  +  aj_  equations  loere  to  be  absorbed j  and  the 
Pj[j  are  the  values  obtained  from  the  N«D"%  multiplication  if  both 
the  a^  and  b^j  effects  are  deleted  from  the  model  and  only  the  a 
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equation  is  absorbed*    For  exainple, 


2 


"•1  "ill 
P     =    pit  =  ZZ  ^ 


ijl  ij2 


P     a   Tni. .....ri.  p!  I    «  72   

^12        n,.,  nj,^ 


etc. 


P' 

11 


i 


etc, 


12 


'i.l''i.2 


etc. 


The  C  and  P  matrices  must  all  be  of  the  same  siae.    In  this 
case,  the  order  of  these  matrices  may  all  be  either  (r+t+2)  X 
(r+t+2)  or  (t-p+2)  X  (t-p+2),  depending  on  whether  the  inverse 
3lements  for  the  columns  and  rows  deleted  as  a  result  of  imposing 
restrictions  are  computed  and  added  back  into  C,  or  wtether  the 
same  subtractions  and  additions  are  completed  in  the  P  matrices  as 
was  done  in  the  S-N'D'^N  matrix*    Both  procedures  will  yield  the 
same  results.    In  fact,  these  subtractions  and  additions  by  columns 
and  rows  in  the  P  matrices  may  be  compieted  in  the  Pj^^  -  PI^ .  and 

the  P!  .  -  P  .  matrices,  if  desired. 


ElLnination  or  Deletion  of  a  Set  of  Effects 

If  the  test  of  significance  for  the  AG  interaction  described  above  indi- 
cates that  the  (ac)jLi,  effects  probably  equal  sero  the  inverse  elements  associ- 

ated  with  the  Cj^,  d  and  g  mav  be  adjusted  by  a  deletion  procedure.  The 

adjusted  inverse  elements  are  the  inverse  elements  that  wculd  be  obtained  if 
all  equations  for  the  (ac)^]^  are  deleted  in  the  reduced  matrix  and  the  matrix 

of  coefficients  for  the  cj^-,  d  and  g  was  inverted  in  the  usual  manner.  The 

adjusted  inverse  elements  are  comDuted  with  the  following  matrix  arithmetics 

GT-*-  -  C.'^-  -  RZ7J  R» 
rt  AG 

where  G^-*-  is  the  matzrLx  of  inverse  elements  associated  with  the  Cj^,  d  and  g  in 
the  C"^  matrix^  Z~l  is  the  jjiverse  of  the  square  segirient  of  C"^  corresponding 
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to  the  (ac)ij^,  and  R  is  the  matrix  of  inverse  blements  T^ich  associate  the 
(ac)jj^  with  the  C|^>  d  and  g. 

Since  Z^^  is  used  to  compute  the  least-squares  sum  of  squares  for  AC^  the 

additional  labor  required  to  obtain  the  new  inverse  matrix  is  often  small  in- 
deed as  compared  vjith  the 'direct  inversion  of  the  matrix  of  coefficients  for 
the  ck,  d  and  g.    With  other  models  the  difference  in  time  required  to  compute 
the  reduced  inverse  matrix  is  likely  to  be  considerably  greater  than  with  the 
particular  model  under  discussion. 

The  estimates  of  the  cj^^  d  and  g  unadjusted  for  the  unequal  frequencies 

associated  with  the  AC  interaction^  may  be  obtained  by  multiplying  the  new  in- 
verse  elements  by  the  appropriate  RHM's®    Sums  of  squares  for  C,  Regression  d 
and  Regression  g  are  computed  by  the  direct  method  from  the  B»Z"1b  multipli- 
cation using  the  new  constant  estimates  and  the  new  inverse  elements.  Also, 
the  sum  of  squares  for  error  must  be  recomputed,  since  the  degrees  of  freedom 
used  for  the  AC  interaction  must  now  be  placed  into  error. 

Numerical  Example 

The  average  daily  gain  (aDG)  for  each  of  bS  steers  of  the  Hereford  breed 
arranged  according  to  line,  sire  and  age  of  dam  is  given  in  the  table  below. 
The  age  at  weaning  and  initial  weight  at  the  beginning  of  the  test  feeding 
period  is  also  given*    All  of  these  stsers  'iere  fed  for  the  same  length  of  time 
in  the  feed  lot.    Identification  numbe,  £-  have  been  assigned  for  line,  sire  and 
steer  for  covenience.    In  addition,  the  original  data  have  been  rearranged,  for 
illustrative  purposes. 

Line  No.      Sire  No, 
1  1 


Age  of 

Steer 

Initial 

Average 

_  Dam 

No. 

V7eight 

Daily  Gain 

3 

1 

192 

390 

2.2U 

2 

15U 

U03 

2.65 

h 

3 

135 

li32 

2.U 

U 

183 

U57 

2c25 

186 

U83 

2.58 

6 

177 

i;69 

2.67 

7 

177 

h28 

2.11 

8 

163 

U39 

2.1i7 

h 

9 

188 

b39 

2.29 

10 

178 

ii07 

2.26 

5-up 

11 

198 

ii98 

1.97 

12 

193 

U59 

2.IU 

13 

186 

li59 

2M 

111 

175 

375 

2.52 

15 

171 

382 

1.72 

16 

168 

I4I7 

2.75 

3 

17 

l5i^ 

389 

2.38 

k 

18 

I8ii 

hXk 

2.146 

5-up 

19 

llh 

h83 

2.29 

20 

170 

1^30 

2.30 
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Line  NOa>     Sire  No« 
2  k 


5 


3  6 


8 


Age  of 

steer 

Initial 

A.verage 

Dam 

No. 

Weight 

Daily  Gain 

1^53  ■ 

2. .91.1 

3 

22 

158 
158 

381 

2.50 

23 

365 

2.)4i 
2.ii.'4 

k 

2h 

169 

386 

25 

lUU 

339 

2.15 

26 

X59 

U19 

2.5ii 

27 

152 

i;69 

2*7lt 

28 

1U9 

376 

2,50 

29 

lij9 

375 

^.5U 

3 

30 

189 

395 

2.65 

31 

187 

Ui7 

2.52 

32 

165 

h30 

2.67 

5-up 

33 

181 

163 

2.79 

3k 

177 

385 

2.33 

35 

151 

iOl; 

2.67 

36 

lli7 

353 

2*69 

h 

37 

18U 

1+11 

3.00 

38 

18U 

ii2C 

2.U9 

39 

187 

h27 

2.25 

UO 

I31i 

h09 

2.U9 

183 

337 

2.02 

U2 

177 

352 

2.31 

3 

li3 

205 

U72 

2.57 

kh 

193 

3U0 

2.37 

hS 

162 

375 

2.6U 

5-up 

US 

206 

U5l 

2.37 

hi 

205 

1+72 

2.22 

1+8 

137 

U02 

1.90 

k9 

178 

hSh 

2.61 

50 

175 

hlh 

2.13 

3 

51 

200 

U66 

2.16 

52 

18U 

356 

2.33 

53 

175 

hh9 

2.52 

U 

>h 

178 

360 

2.1;5 

5-up 

55 

189 

385 

iM 

56 

iBli 

a3i 

1.72 

57 

183 

UOI 

2.17 

3 

58 

166 

hoh 

2.68 

If 

59 

187 

li82 

2.1t3 

60 

186 

350 

2.36 

61 

I81i 

hQ3 

2.hh 

5-up 

62 

180 

i»25 

2,66 

63 

177 

1|20 

2.ii6 

61^ 

175 

hk9 

2.52 

65 

I6ii 

U05 

2M 

Th8  mathematical  model  underlying  the  analysis  to  be  made  vlth  these  data  is 
yijkl  =  a  +  gi  +  Sij  +  aj^  +  (ga)i]c     bAijki  +  d^i^i^l  +  ©ijkl 


i  =  1»..2,  3  k  «  1,  2,  3 

j  «  1,  2,  3,  ii  1  =  1,  2,  6 

where:  ^ijkl  average  daily  ga5ji  for  tiie  l''^'^  steer  in  the  k"^^  age 

of  darn  class  by  the  j'^^-'  sire  of  the  l'^-'^  linej, 

a  -  the  theoretical  population  mean  mth  equal  subclass 
frequencies  when  weaning  age  and  initial  weight  both 
are  equal  to  the  absurd  value  of  zeroo    The  population 
mean  with  equal  frequencies  vxhen  the  weaning  age  and 
initial  weight  both  are  equal  to  the  average  is 

=  a  +  bA  +  dW, 


g±  «  effect  of  the  i"^^  breeding  gi^oup  or  line^ 

s^j  =  effect  of  the  j'^^^  sire  in  the  i''^^^  line, 

aj^.  =  effect  of  the  k"^^^  age  of  dam  class, 

(ga)^j^  =  interaction  efic.ts  for  line  and  age  of  dam, 

b  =  partial  regression  of  ADG  on  age  at  weaning.    Since  all 
steers  were  placed  on  test  at  the  same  time  this 
regression  coefficient  will  measure  the  effect  of  time 
of  birth  on  ADG  in  the  feed  lot, 

•^ijkl  °  ^S®  at  weaning  for  a  given  steer, 

d  =  partial  regression  of  ADG  on  initial  weight, 

%jkl  "  Initial  weight  for  a  given  steer, 

®ijkl  ~  random  errors. 

Complete  Least-Squares  Analysis 

The  least-squares  equations  are  presented  in  Table  1,  The  reduced  set  of 
least-squares  equations,  resulting  from  imposing  the  restrictions  th^t  2gj_  =» 

i 

2Sj_^  =  Zai,  =  Z(ga)4j.  =  Z(ga)^i^  =  0  and  after  completing  the  appropriate  sub- 
j  k        i  k 

tractions  and  additions  by  columns  and  rows,  is  presented  in  Table  2,  The 
matrix  inverse  to  the  reduced  least-squares  matrix  is  given  in  Table  3« 
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Estimates  of  the  constats  obtained  from  the  solution  of  the  reduced  set  of 
equations  are  as  follows :12/ 


a  " 

3 • 06106 

^1  ° 

Si  " 

-,0803< 

-«006ll 

A 

gj  - 

e03?30 

-•11005 

A 

.09051 

(ga)3_2  = 

,01311i 

A 

^12  " 

-»10750 

(ga)2T|^  « 

.02522 

^21  " 

-«11P17 

(ga)22  ° 

-.08066 

'331  « 

.I658ii 

b  » 

-.OO815I4 

^32  ^ 

,0kO9B 

A 

d  " 

•001969 

S33  ^- 

-•26909 

Sstimates  for  the  overall  meanj,  y.,  and  the  constants  omitted  by  the  subtractions 
are  computed  as  follows: 

|1  ■  a  +  bl  +  dW 

»  3.06106  -  («00615U)  (176^65)  +  (.001969)  (U6.S5) 

-  2^liiilii3 

g3  -  -  (-.08035  +  .03730)  =  ,OU305 

S13  «  -  (^09051  -  .10750)  =  .01699 

S22  =  -  (-ell917)  -  -lOT? 

^3h  '  '  (-l^^Q^  *  *OU098  -  ,26909)  »  .06227 
33  «  -  (.069ii6  -  ,00611)  -  -.06335 
(ga)i3  »  -  (-.11005  +  «0133ii)  =  .09691 
(ga)23  =  -  (.02522  -  ,08C66)  =  .0$Skk 
(i^)yi  "  -  (-.11005  +  «C2522)  =  .08183 
(£1)32  "  -  (.0131ii  -  .06066)  »  .06752 
(ga)-«  -  -  (•081483  +  «06752)  -  -^15235 


12/  Several  extra  decimal  digits  are  given  so  that  sums  of  squares 
computed  later  will  be  more  accurate. 


-107- 


Analysis  of  Variance 

A  test  of  significance  for  the  possible  interaction  of  sires  with  age  of 
dam  (SA;G)  will  first  be  completedo  The  least-squares  equations  for  b'  and  d' 
on  a  within  line  X  sire  X  age  of  dam  subclass  basis  are  as  frllows: 

^,830o3&       +  8,046.82  d'  »-i9«5927 

8,046,82  P  +  68,016.78  d'  -  73.5537 

The  estimates  of  h'  and  d'  are  obtained  bj  solving  these  two  equationse 

b'  -  -.007295 

d'  «  .001945 

The  error  sum  of  squares  for  testing  the  significance  of  the  SA:G  interaction 

^^^lik  -  -  R(b',d') 

ijkl    ^^"^     ijk  ^i^^ 

=  2.2673  -  (-.007295)(-19.5927)  -  ( .001945)(73e5537) 
»  2.2673  -  .2860  »  1,9813 

The  GSA  observed  and  expected  subclass  means  are  given  in  Table  4.    The  ex- 
pected means  are  computed  by  first  adjusting  the  subclass  total  (for  Y)  with 
the  constant  estimates.    Fcr  example,  the  expected  subclass  mean  for  line  1, 
sire  1  and  age  of  dam  3  is 

•|-[4.89  +  .08035  -  .09051  -  .06946  +  .11005  +( .008.54) (173.00  -  176.65) 
-  (o001969)(396.50  -  416,85)1 
«  2.4654 

The  sum  of  squares  for  the  SA:G  interaction  is  computed  ft  m  Table  4  as 
follows: 

S.  Sqs.  -  »  -  SA:G  -  (2)(-.0204)^  +  (2)^.0059)^  +  +  +  (4)r-o0183)^ 
-  .4890 

This  sum  of  squares  may  be  more  easily  computed  from  380.6325  +  .2860 
-380.457A  =  .4611.  Rounding  errors  probably  account  for  the  apparent 
descrepancy  in  the  two  methods. 

The  test  of  significance  for  the  SA:G  interaction  may  now  be  completed  as 
follows: 

Source  of  Variation    d«f .    S.Sc8»  M.S. 

SA:G  10        .^611     .04611  n.s. 

Error  .'=^8  »05214 
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Table       Obsarved  and  expected  GSA  subclass  means 


Line    Sire  Age  of  Means 


No, 

No. 

Dam 

Observed 

ExDected 

Obs«-SxE 

1 

1 

3 

2 

2.Zi654 

-.0204 

1 

1 

U 

2 

2*3300 

2,32Ul 

•0059 

1 

1 

r' 

h 

2.6075 

2 .5761 

.03m 

1 

2 

ii 

2 

2.2750 

2*3852 

",1102 

1 

2 

5 

6 

2.2567 

2,2846 

-.0279 

1 

3 

3 

1 

2 e 3800 

2.354I 

.0259 

* 

3 

1 

2,li600 

2,5819 

-.1219 

1 

3 

5 

3 

2 .5100 

2 .4015 

.0205 

2 

] 

4 

3 

2 

2.U700 

2  ♦1+307 

9O393 

2 

!. 

1. 

4 

c 

2,2950 

2 • 3507 

-.055/ 

2 

1 

4 

1 

4 

2.5800 

2.5562 

.02  38 

2 

5 

3 

1 

2 ,6500 

2.5426 

.107U 

2 

5 

t 

4 

2 

2.5950 

2.5362 

,0500 

2 

5 

5 

h 

2.6200 

^.5130 

.1062 

3 

o 

1 

4 

2 

2,7U50 

2,0411 

.1039 

3 

O 

5 

4 

2,^675 

2,2992 

-,0317 

3 

•7 

i 

3 

2 

2,I|.700 

m  "71. 

3 

7 

! 

4 

2961+00 

2.U575 

3 

7 

5 

2,2h60 

2, 2851 

-,0391 

3 

o 

o 

3 

3 

2  e  3307 

2*3031 

°,04O4 

3 

8 

4 

1 

2,U5oo 

2.7376 

-,2876 

3 

8 

3 

1.7767 

1.95ii8 

-.1781 

3 

9 

3 

1 

2 e 6800 

2.3589 

.3211 

3 

9 

h 

3 

2 .UlOO 

2,36U8 

.0U52 

3 

9 

5 

2.5150 

2.5333 

-.0183 

The  ratio  for  F  ( *'^^^^^)  ig  ^333  than  one,  indicating  clearly  that  there  is  no 
.0521ii 

interaction  of  sire  by  age  of  dam  within  lines.    The  analysis  of  variance  can 
now  be  set  up  to  test  the  significance  of  the  effects  for  which  constants  have 
been  fitted. 

The  sums  of  squares  are  computed  from  the  sub-inverse  matrices  and  the 
constant  estimates  by  the  general  direct  method,  B'ZB,  as  follows: 


S.Sqs,— G  =   1^,08035  .0373(3^' 

=  t.08035   .037 30J 

[lli. 


01+6831  -.0291+01 
0291+01  .062328 


=  tl.90385  -.29963] 


33781+  II+.31079 
31079  22,79U76 
-.08035 


►li+18 


[-.08035 
[.03730 


S.Sqs.~S;G  »    ^0905l  -.lOT^o] 


098039  -•0U1738]"3- 


OU1738     e 109726 


^1* 


(-.11917)' 
.079911 


13li969  -.Oiil75ii  ».0^2016 


tl6581i    .0it098    -',2690§^-';:o6OTr    .10l;$60  -.028235 
,052016  -.028235  .1200liU 


-1 


S.Sqs." 


=  .13528  +  .17772  +  .62U78 
=  .9378 

06111;  -.03ii39U 


-ii  =  f 069U6  -.00611] 
»  .12U7 


-.03U39ii  .OL.512U 


S.Sqs.~GA  =[-.11005    .01311;    .02522  -.08066] 


.132501  -.079153  -.080915  .0U5079 

-.079153  .093828  .Oii8381i  -.05655U 

.08091:'  .0U838i4  .126387  -.07li6l;3 

.oit5o?9  -.o5655ii  -ToTUSJlJ  .098938 


GA 


S.Sqs.— b 


S.Sqs, 
S.Sqs < 


.li65l 

iz::22^  .  .31.82 
.00019093 

(.001969)^ 


.00001503 


.2579 


—Error  »p  Zl^ZZ  y±^]^±  -  R(all  constants) 

=  382,8998  -  (3.O6IO6)  (156.7^  -  (-.08035) (^17.69)  -  -  - 

-(.001969)(65,U28.M4) 
=  382.8998  -  380.3931  ■ 
^  2.5067 


The  analysis  of  variance  table  may  now  be  set  up  as  follows: 


Sc'ijiTc-e  of  Variation 

S.Sqs, 

F 

Lines  \Q) 

<1 

Sires  (SjG) 

6 

.9530 

.1588 

2„99^ 

Ages  of  Dam  (A) 

2 

.062li 

1«20 

GA 

h 

.1^651 

•  1163 

2»23 

Regression  b 

1 

.3I482 

.3U82 

6,67* 

Regression  d 

1 

.2579 

ii.9li^ 

Error 

li.8 

2«5067 

•0522 

The  tests  of  significa-ice  assume  that  the  effects  of  sires  and  error  are  random 
and  that  all  other  effects  are  fixed.    The  lack  of  significance  for  the  GA 
interaction  suggests  that  more  accurate  estimates  of  all  other  effects  could  be 
obtained  by  deleting  the  interaction.    The  short-cut  deletion  procedure  will  be 
illiistrated  in  the  next  section  rath  crossline  data.    The  remaining  analysis 
with  this  example  will  assume  that  the  interaction  effects^  (6^)ik^'  Qxist  and 
must  therefore  be  retained  in  the  analysis. 


Variance  Component  Estimates 

If  the  classifications  of  G,  SiG  and  A  are  regarded  as  random  the  investi- 
gator is  interested  in  estimating  Og,  o|jg,  o^  and  Og^,    To  compute  estimates 

of  these  variance  components  from  the  complete  least-squares  analysis  it  is 
first  necessary  to  compute  the  coefficients  for  them  in  the  expectations  of  the 
mean  squares.    Since  all  of  the  subclasses  of  G  and  A  are  fitted  and  the  number 
of  degrees  of  freedom  for       SsG,  A  and  GA  are  small  it  is  fairly  easy  to  com- 
pute estimates  of  of.„.  of  and  a*_.    The  expectations  of  the  mean  squares  for 
o .  g      (A  ga 

G,  S:G,  A,  GA  and  errors  are  as  follows: 

E(MS) 

k^a| 


G 

<i  * 

■^7  °|a  * 

1^8 

<4: 

S:G 

^6  "iig 

A 

GA 

Error 

o2 

The  coefficients  k'^,  k^,  k^,  and  k^  may  easily  be  computed  by  the  direct 
method  for  this  particular  analysis »    For  example, 

k^  =      I30.3378I;  +  22,79U76  -  «(lii.31079  +  Ui.31079)] 


=  i  (38.82181)  =  12.9U. 
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The  values  for  k-^,       and  k^,  when  >:;oraputed  in  a  similar  manner  from  *h8 
inverse  matrices  used  to  compute  the  sujbi  of  squares  directly,  are 

k3  =  6«967 
k^  =  l$.k3Q 

Since  all  GA  subclasses  are  filled,  the  coefficients  for  cr|^,  k^^  and  ky,  are 
computed  as  follows: 

%  =  jk^  =  ^,hl9 
ky  -  ^k^  =  ii.311i 

The  only  me+hod  available  at  present  to  compute  kg  is  the  indirect  method  of 

Henderson,  which  requires  the  inverse  of  the-  13'xl5  least  "Squares  coefficient 
matrix  when  the  g,.  are  omitted  from  the  model  as  well  as  the  associated  sums 

matrix  of  the  same  order.  Since  the  indirect  tiiethod  was  illustrated  with  the 
numerical  example  of  the  last  section,  kg  will  not  be  computed  for  this 

analysis.    The  estimates  of  <y^Q_^  of  and  of.g  are  ^'omputed  as  follows: 


SsG 

d.f« 
6 

Mean  Squares 
.1588 

4- 

E(MS) 
6.242  o|.g 

A 

2 

.0623 

of  - 

GA 

k 

.1163 

6,967 

Error 

18 

.0509 

of 

+  l5.i?38  o| 


o|  =  ,0522  a|     -  -.0026 

a|^=  .0092  a|      =  .0170 


Method  2  of  Henderson 

Frequently  with  animal  breeding  data  such  as  these,  the       and  s^^  will  be 

genetir  classifications  for  which  variance  componert  estimates  are  desired. 
All  other  effects  in  the  model,  except  the  errors,  will  be  fixed.    V.Jhen  tht> 
numbers  of  degrees  of  freedom  fo.  the  genetic  clasf 'fication?  are  adequate  the 
complete  least-squares  analysis  will  usually  be  irap'^actical.    The  modified 
least"squares  analysis  developed  oy  Henderson,  which  proiddes  exact  least-  [ 
squares  tests  of  significance  for  fixed  efXocts  when  no  interactions  ex3.st 


among  fixed  and  random  effects  and  unbiased  estimates  of  the  variance  com- 
ponents, Ti«7ill  now  be  illustrated  mth  the  present  set  of  data. 

Step  1>    Absorption  of  the  a-s-g3^+Sj_j  equations. 

The  N«D"%  Talues  are  presented  in  Table  5  and  the  S-N»D"%  values  (the 
reduced  matrix)  are  presented  in  Table  6,    These  values  were  computed  from  the 
original  least-squares  equations  in  the  same  manner  as  was  illustrated  with  the 
numerical  example  of  the  last  section. 

Step  2<,    Imposing  the  restrictions  that  ^  a^^  =  ^(6^)ik~  ^(S3')ik  " 

The  reduced  set  of  equations  for  the  a.^^  (ga)jj^,  b  and  d  after  completing 
the  appropriate  subtractions  by  columns  and  rows  is  presented  in  Table  7« 

Step  3*    Inversion  of  reduced  matrix  and  calculation  of  constant  estimates. 

The  inverse  of  the  reduced  matrix  for  the  aj^,  (ga)j^j^,  b  and  d,  (C),  is  the 

same  as  given  for  this  segjnent  in  Table  3»    The  estimates  of  these  constants, 
obtained  from  solving  the  reduced  set  of  equations,  are  identical  with  those 
obtained  vrhen  the  complete  matrix  was  solved. 

Step  U,    Least-squares  sums  of  squares «, 

The  sums  of  squares  for  A,  GA^,  Regression  b  and  Regression  d  are  the  same 
as  given  in  the  complete  least-squares  analysis. 

Step  5«    Analysis  of  variajice  for  fjjced  effects. 

The  analysis  of  variance  for  these  effects  is  the  same  as  was  given  for  A, 

GA,  Regression  b.  Regression  d  and  error  in  the  complete  least-squares  analysis. 

When  the  complete  least-squares  analysis  is  not  done  the  error  sum  of  squares 
is  computed  from  ^ 

If  ^ijk  - 1]  if.  -  H  (^2-N'i>-'^i) 

as  was  indicated  previously. 

Since  the  gj_  and  b^^  are  now  assumed  to  be  random  no  exact  test  of  signifi- 
cance is  available  for  the  aj^  effects. 

Step  6,    Computation  of  the  adjusted  totals  for  the  GA  subclasses  and  the  G 
classes. 

The  adjusted  totals  for  the  line  X  sire  subclasses  are  computed  as  follows: 
=  19.98  -  (2)(,069U6)  -  (2)(-,006ll)  -  (U)(-.C6335) 

»(2)(-,1100^)  -  (2)(«0131U)  »  (l4)(.0969l)  -  (-,008l5ii)(li|17) 
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CD  <M 
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vO 

CM 
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■vO 

so 

lA 

iH 
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On 
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CO 

IH 

!>- 

O 

rH 

H 

CM 
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r 

OC' 

-3 

iH 

«\ 

! 

CM 
1 

CVI 

8 

rH 
! 

1  S 

O 
fH 

OA  OA 

OO  vO 
O  CO 


2 


CM 

CM 

H 

H 

1    CM  CM 

/-^ 

cd 

(« 

1  ^  ^ 

w 
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-(.001969)(3501) 

Y"    -    18,09-  (2)(-*C06ll)-(6)(-.,06335)-(2)(,01311i)-(6)(,09691) 
12, 

-(-^OOSl^i^)  (IhSlhi  •001969)  (3li36) 
-  22«9895 

Y««    -=111.9909  I' 5    -  19.6223  I'»  =25.331+8 

13.  31.  3k 

Y» »    -  23«8369  Yl  '    =  25.l6h3 

21,  j2» 

Y««       22.5'2&3  Y"    =  19.9128 

22  33. 

The  adjusted  totals  for  lines  are 

Y««  =  2i;.5736  +  22.9895  +  lii,9909  =  62.551iO 
1.0 

Y"  =  146.3629 
2». 

Y'  •  =  90.03li2 
3,« 

Step  7»    Computation  of  sums  of  squares  and  variance  component  estimates. 

The  adjusted  sumvS  of  squares  for  line  (G» )  and  sires  within  lines  (S'sG') 
may  now  be  computed  as  follows: 

S    Sqs  -G«  =  (62.55iiO)^  ^  (i|6.3629)=^  ^  (90.03U2)^  ^  (198.9511)^ 
21  15  29  65 

=  609.1571  -  608.9li68 

=  .2106 

I       S.  Sqs.-S.  :G.  =  L^h^J^.i^^^^  .  .  .  i^^M)!  .  609.157U 

08  8 

=  610.2110  -  609.l57li 
=  1.0536 

The  analysis  of  variance  for  lines  (G' )  and  sires  within  lines  (S':G«)  is  set 
up  as  follows: 
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d^f.  S,Sqs.     lUSj^  E(MS) 
G»  2       .2051      .1026  ^     ^13  ^ll^  ^ 

S'tG         6      1.0876      .1813         k^^Q  of  +  k^^  oi^g 

Error     U8  .0522  of 

The  coefficients  for  ofjg  and  a|  om  the  S(MS)  are  com^^uted  as  follo^?s: 
1,.^    (8)^-^(8)^5)^     (8)2+(7)^     (6)^-H(8)g^(7)^^-(8)^  , 

=  ^(65  -  22.1639) 

=  |(U2.8361) 
o 

=  7.139 

1(22.1639  -WM3)-^(5)-^^M8)^, 
=  -i- (22.1639  -  7.3692) 


i.(li;.79ii7) 


=  7.397 

V     -  1           (21)^-H(l5)^^(29)"  . 
=  ^(65  -  —  ^  —  ) 

=  -^(65  -  23,3185) 

=  I  (Ul.6815) 

=  20.8I4I 

The  coefficients  for  o|,  k^^  and  k^^*         computed  froit!  the  segment  of  the 

matrix  inverse  to  the  varianca-covariance  matrix  corresponding  to  the  effects 

which  remained  after  absorption  and  the  P. _  matrices,  i.e.^ 

•'-J 


ho    ^  * 


and 


1  *  i  |soi^(P. .  -  P,.)j 
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The  P' '  matrix  was  computed  during  the  abscrption  of  the  a+gi+Sij  equations  and 
is  given  in  Table  5»    The       matrix  is  given  in  Table  8  and  the  P  matrix  is 
given  in  Table  9«    These  matrices  i^jere  comput.3d  iji  the  manner  previously  de- 
scribed from  certain  coefficients  in  the  original  set  of  least-squares  equatioB 
(Table  1)«    The  P''»P'  matrix,  after  completing  the  appropriate  subtractions  by 
rovs  and  columns^  is  given  in  Table  10 e    The  reduced  P'-P  matrix  is  given  in 
Table  11. 

The  coefficients,  k^Q  and  k^pj,  are  now  computed  as  follows? 
k^Q  -  1  +  "I  [(*060111i)(2,6256)+(-„03li394)(«6l91)  +  (.010960)(".„ij303) 

+  +  <•  (.00001503)  (7376)] 

«  1  f  i  (,7188) 
6 

=  1,120 

k^2  =  ^  ^  k  [•06011h)(«26l6)+(-.03U39ii)(.l696)  +  (. 010960) (1.3137) 

+  *  +  ( .00001503)  (I0yii85)] 
=  1+1  (ia379) 

=  1.569 

The  computed  mean  squares  may  now  be  set  equal  to  their  expectations  and 
the  variance  component  estimates  computed, 

d.f .        Mean  Squares 

G'  2  ,1026    =  1.603  a|  +  7.397  o|.g  +  20.841  a| 

S':G'      6  .1813    =  1.136  a|  +  7.339  o2.^ 

Error    48  .0509  = 

e 

.1813  -  (1.136) (.0509) 


''s^S  7.139 


-  .0173 


.1026  -  (1.603)(.0509)  -  (7 . 397) ( .0173)    ^  „.0051 
20.841 

The  estimate  of  .0173  for  o|.g  is  essentially  the  same  as  that  obtained 
from  the  complete  least-squares  analysis  where  Og.g  was  estimated  to  be  .0166 
The  difference  is  due  to  different  methods  of  estimation. 
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Table  10.    The  reduced  P»S"P«  matrix. 
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(ga)^3^ 

(ga;-j_2 

(ga)2i 

(ga)22 

A, 

b 

d 

ais 

2.62^6 

.6191 

-«ii303 

.3855 

-1.31.92 

-.0812 

-20«67 

19.114 

.8270 

■  .3855 

-.3177 

-.0812 

-.5655 

-23.2p 

11.88 

2.5063 

*r953 

1.1.683 

.10).9 

-17.36 

-2.82 

(ga)-L2- 

.8223 

.10U9 

.5700 

12.87 

1.7ii 

(ga)2i: 

1.587li 

,1286 

-II1.6O 

-33.23 

(ga),,. 

.5TU5 

lli.8l 

-9.52 

b: 

1967 

1002 

d: 

7376 

Table  11.    The  reduced  P'-P  matrix. 


^Sa)T_2_ 

(ga)2^2 

(ga)2i 

(ga)22 

b 

ft 

d 

^1- 

.2616  .1696 

1.313T 

1.C908 

.1115 

.1530 

-ii6.00 

^2- 

.1229 

•7060 

-.15U8 

-.1109 

-IO.I46 

-35.76 

(ga)iiJ 

8.2102 

6.9129 

3.iai8l 

3.1033 

61.37 

-189.36 

(ga)^^* 

i^255 

3.0?6ii 

2.7163 

-i5ii.lii 

(ga)23^J 

ii.0521 

IUO.28 

62.17 

(ga)22* 

119.09 

bs 

3528 

ds 

10,1^85 

M0DSI3  FC  H  THE  kil.hxST^  jF  DATA  HtOH  CROSSBREEDING  EXP3RIM3NTS 

The  general  least-sqrares  procedures  far  the  analysis  of  data  frora  single 
crosses  only  were  developed  by  Heiidsraon,  13/   The  desi^s  of  many  crossbreeding 
or  line  crossing  experiments  result  in  data  which  ma;^  be  analyzed  in  several 
ways.    Three  methods  of  analysis  will  be  considered  in  this  section:  first, 
the  appropriate  analysis  for  data  obtained  when  all  possible  matings  are  made 
between  li.nes  but  no  linebreds  are  produced^    second,  an  analysis  of  all  dat^ 
from  linebreds  and  linecrosses  produced  together  vfnerein  the  linebred  effects 
are  estimated  Independently  of  the  geneial  combining  ability  and  maternal 
ability  effects:  and  third,  an  analysis  of  all  data  from  linebreds  and  line- 
crosses  produced  together  wherein  the  linebred  effects  are  estimated  simul- 
taneously with  the  general  combining  ability  and  maternal  ability  effects. 


Analysis  of  Linecross  Data 

The  analysis  of  data  in  wliich  linebreds  are  not  included  will  be  con- 
sidered first. 


Model 

The  underlying  mathematical  model  for  the  analysis  when  linebreds  are  not 
included  is: 

yijk     ^  +  gi  +  -f-  Cj^j  +  r^j  +  e^jj^  .•••(?} 

where  7±^\r  is  the  k*^^-  observation  on  the  progeny  from  the  i't^  line  of  sire  and 

the  j'*'^^  line  of  dam,      is  the  over-all  mean  when  equal  subclass  numbers  exist, 
gj_(g-j)  is  the  general  combining  ability  effect  for  the  i*"^  id^^')  line,  is 

the  maternal  effect  for  the         line  of  dam,  Cj[_j  is  the  specific  combining 

ability  effect,  v^^  is  the  sex-linkage  or  reciprocal  effect  and  e^jj^  is  the 

random  error,  assimied  to  be  NID(0,o|),    The  gj^(g^)  are  assumed  to  be  one-half 

the  additive  genetic  value  (breeding  value)  of  the  i^^Cj"^^)  line,  and  the  value 
is  expressed  as  a  deviation  from  |i.    The  mj  measure  the  pre-natal  and  post- 
natal mothering  ability  of  a  line,  and  each  is  a  function  of  the  genotype  of  a 
line  rather  than  of  the  genes  transmitted  to  the  female  progeny  of  the  line. 

The  Cj^j  are  effects  in  addition  to  the  gj^  and  m .  effects,  and  are  measured 

over  the  progeny  of  the  i'th  Une  of  sire  vjith  the  line  of  dam  and  the  pro- 
geny of  the  j't'h  line  of  sire  vdth  the  i^^h  Une  of  dam.    The  c^j  measure  how 

much  better  or  poorer  than  average  are  the  means  of  reciprocal  matings  than 
would  be  expected  on  the  basis  of  exact  knowledge  of  the  additive  genetic  val- 
ues and  maternal  values  of  the  lines.    The  Vj^A  are  effects  in  addition  to  the 
additive  genetic,  maternal,  and  specific  effects  and  are  measures  of  the  diff- 
erences between  rociprocal  crosses  after  account  has  been  taken  of  the 


13/    Henderson,  C.  R,  15 u9.  Ibid, 
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differsnces  in  ma.arnal  ability  between  the  l'^^  line  and  the  j^h  Hhq^  yot  a 
more  detailed  discussion  or  these  effects,  see  Henderson's  thesis. 


Least-Squares  Equations 

The  least-squares  equations  for  model  (7)  are  nresented  in  tabular  form 
be  low  i 


Si 

m . 
0 

cij 

rij 

ne. 

i^io+n.i 

^ij 

I.» 

^ij-^^ji 

gi 

^i/"-i 

"i.j 

"ij-'^ji 

Yi.-^Y.i 

"if^ji 

o-ia° 

"la 

^ij 

"ij-'^ji 

^ij'"^ji 

nij 

'^i^r^ji 

^ij 

0 

These  equations  differ  somewhat  from  the  least-squares  equations  presented 
previously,  in  that  the  off -diagonal  elements  in  the  segment  of  the  coefficient 
matrix  concerned  with  general  combining  ability  effects  aia  not  zero.    This  is 
a  result  of  the  fact  that  each  crossbred  group  is  used  ia  estimating  the  gen- 
eral combining  ability  effect  of  two  lines.    For  example,  a  cross  between  a 
sire  of  line  1  and  a  dam  of  line  2  is  used  in  estimating  the  general  combining 
ability  of  both  lines  1  and  2,    Thus,  the  sum  of  the  coefficients  for  the  gj^  in 
the  \x  equation  is  equal  to  twice  the  coefficient  for  J.    However,  the  sums  for 
the  coefficients  for  the  mj,  Cij,  and  rij  effects  in  the  [i  equation  are  equal 
to  one  another  and  also  equal  to  the  coefficient  for  |I,    In  addition,  the 
totals  of  the  RHM's  for  all  equations  other  than  the  gj_  equal  the  grand  total 
Y,,  .    The  total  of  the  PH!-f*s  for  the  g^^  equations  is  equal  to  2Y..  ,  Before 
the  coefficient  matrix  can  be  inverted  to  obtain  the  inverse  elements  for  use 
in  making  the  appropriate  tests  of  significance,  certain  restrictions  must  be 
imposed  on  the  constant  estimates. 

Imposing  Restrictions  on  the  Constant  Estimates 

Convenient  restrictions  that  can  be  imposed  on  the  constant  estiraates  are 

The  restrictions  that  ?gj_  ^  ^mj  =  0  may  be  imposed  by  completing  subtractions 
by  rows  and  columns  as  expiated  p>^viously.    The  subtractions  ajid  additiont' 


required  to  xnrpose  the  restrictions  on  the  estimates  for  the  specific  effects 

(2c^  ^  =  Zc;!4  =  Z  Zc^  ^  =  0)  will  be  given  for  the  specific  design  of  all  possitQe 

crosses  among  four  lines. 

If  the  numbers  in  the  subclasses  concerned  T-d-th  the  measurement  of  spe- 
cific effects  are  represented  by: 

^12     ^^13  "^lii 

""31     ''32  ''31; 
"lil     %2  %3 

then  subtraction  of  the  last  column  in  each  row  from  all  other  coefficients 
results  in: 

^12""lU  "l3*""ll;  "lii""ll; 

^2l"''2U  ^3"''2U  "2U"^2li 

^3r^3li       ^32-^3U  ^3U'"3it 

%1%3       "U2"%3  ^3^3 

Subtraction  by  row  then  jrields: 

'^12-"llt-nii2+nU3  "I3*"^lli 
n2i-n2[;-ni^l+n^3  n23'''^2l; 

^3l""3rtl''%3  "32"^3U'%2'"%3 

Since  the  specific  effect  for  a  particular  cross  is  measured  over  the  two 
reciprocal  crosses,  these  values  must  be  combined  as  follows: 

"12-niii-n^2+n^3+n2i-n2^-n^+n^3  ^iyT^iU*ri3r^3hr''hl''%3 

^3''''2li"'^32"^3U"''i|2"'^ii3 

or      n.2.n^-n.j.2n^3  "13-^11,-^31^^3 
43-4l-"3U""U3 
where    n!^^  =  ^2''"2l'    ''lU  '  "lU*V* 

restriction,  Z  Zcj^ j  =  0,  results  in  the  following  equations: 

-12^. 


The  third 


and      nl3-nii,-n3^-nj^3-n'3-n.|^-n3j^-nj^3  -  n-j-nj^^-n- ^^n'^  (9) 

The  restriction  on  the  equations  for  the  reciprocal  crosses  that  Z^±^ 
2r..  «  0  results  in  subtraction  by  row  and  column  by  the  same  procedure  as 

followed  for  the  specific  effects.  The  restriction  that  the  reciprocal  effects 
for  each  cross  must  sum  to  zero  (r^^  +  r^^  =  0)  yields  the  following  equations: 

'^i2''^iir%2^\3''^2i'^ii'^hi'%3 '  ^i2"^i'r"i2"^i''^h''"ia  *  •  ^-^^^ 

"l3""lli"^3l'Vl;lA3  '  ^3'^"^31*Vl;l"%3  '  * 

"23""2r"32*W"^3  °  "23-"2U-^32 VU2-\3  '  '  ^''^ 

The  subtractions  and  additions  outlined  in  equations  8-12  are  those  that  are 
completed  for  the  specific  combining  ability  effects  and  the  residual  reciprocal 
effects  in  the  u  equation.     The  subtraction  and  additions  among  the  coefficients 
for  these  effects  in  all  other  least-squares  equations  follow  the  same  form  as 
given  in  the  above  equaticns.     The  procedures  outlined  in  these  equations  are 
also  completed  in  the  RIIM's  for  the  specific  and  residual  recinrocal  effects, 
using  the  appropriate  sums  in  order  to  obtain  the  reduced  RHN's. 

The  equations  for  :he  subtractions  required  to  impose  these  restrictions 
on  the  specific  and  residual  reciprocal  effects  will  differ  deoending  on  the 
number  of  lines  involved.    However,  the  procedures  shown  above  can  be  utilized 
to  determine  the  appropriate  equations.     After  the  coefficient  matrix  has  been 
reduced  there  will  remain  a  symmetric  matrix  of  order  p(p-l)  ,  v;here  p  =  the 
number  of  lines  included  in  the  study. 


Inversion  of  the  Reduced  Matrix 

The  size  of  the  resulting  least-squares  matrix  is  normally  so  large  as  to 
make  its  inversion  by  means  of  a  desk  calculator  extremely  laborious  if  not 
prohibitive,    Programs  are  re-dily  available  for  the  inversion  of  matrices  by 
high  speed  computing  equipment,    1'Jhere  the  size  of  the  matrix  dictates  such 
action,  partitioning  of  the  matrix  may  be  required. 


Computing  the  I/Sast-Square  Means  and  Standard  Errors 

The  equations  may  be  solved  directly,  at  the  same  time-  as  the  inversion  of 
the  matrix,  in  order  to  obtaiu.  estimates  of  the  constants  |i,  g^,  mj,  c^^^  and 

Tj^j,    How3ver,  it  is  sometimes  more  convenient  to  obtain  the  constants  from  the 

inverse  elements  and  the  RIM's  of  the  equations  by  means  of  the  formula  pre- 
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viously  presented, 

ZC^^Y.  »  Si 
J 

where  C^j  is  the  inverse  element  for  the  i'^h  row  and  J^h  col.mon  of  thB  inverse 
matrix,  Yj  is  the  RHM  for  the  j'th  row  and       is  the  estimate  of  the  1"^^  con- 
stant.   The  constant  estimates  for  the  model  under  discussion  will  include  the 
least-squares  mean  and  the  estimates  of  the  general  combining  ability  effects, 
the  maternal  effects,  the  specific  combining  ability  effects  and  the  reciprocal 
effects  expressed  as  deviations  from  the  least-squares  mean* 

In  order  to  estimate  the  standard  errors  of  the  mean  and  othttr  constant 
estimates  it  is  necessary  to  estimate  the  error  mean  square,  o|,  which  is  given 

by 

^V^TTT^  j^Z22y|-3k-Rax,g,m,c,r)j 

where  — — »  degrees  of  freedom  for  error, 
n.,-p(p-l)        ^  ' 

2 

22Z  y.;  •;v.  =  total  uncorrected  sum  of  squares, 
ijk  "^"J^ 

and   R(|j,,g,m,c,r)    »  reduction  in  suj/i  of  squai'es  due  to  fitting  all  constants© 

The  reduction  due  to  fitting  all  constants,  R(iJ,,g,m,c,r)  can  be  computed 
from  all  constant  estimates  and  the  original  set  of  RHM's  as  follows  J 

R(^i,,g,m,c,r)  »  (iY,.  +       (Y^^  +  Y.^)  +  .  .  «  *  +?pp  Ypp. 

The  reduction  due  to  fitting  all  constants  can  more  easily  be  calculated  from 
the  constant  estimates  obtained  from  the  solution  to  the  reduced  matrLx  to- 
gether with  the  reduced  RHM's. 

The  standard  error  of  the  mean  is  obtained  from 

-  V'  cm 

where  Ci^  is  the  diagonal  inverse  element  corresponding  to  the  p,  constant©  In 
order  to  estimate  the  standard  errors  for  all  the  remaining  constants  it  is 
necessary  to  complete  the  inverse  matrix  for  the  equations  that  were  subtracted 
out  when  imposing  the  restrictions  on  the  constant  estimates.    Once  the  matrix 
has  been  completed,  standard  errors  for  constant  estimates,  or  linear  functions 
of  constant  estimates,  can  be  calculated  as  explained  in  the  first  two  se.tioi^ 


Coii5)uting  Sums  of  Squares  for  the  Analysis  of  Variance 

The  sums  of  squares  for  the  g^,  mj,  c^j,  and  r^j  can  be  obtained  directly 
using  the  formula  B'Z~1b  where  Z"^  is  the  inverse  of  the  segment  of  the  matrix 
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inverse  to  the  variance-covariance  matrix  corresponding  to  the  particular 
effects,  and  B'  and  B  are  the  constant  estimates  for  these  effects.    The  s-ums 
of  squares  may  also  be  computed  indirectlj--  by  differences  in  reductions  after 
obtaining  the  solutions  to  other  sets  of  equations. 

In  this  analysis  the  sums  of  squares  could  be  gotten  as  follows: 

■Source  of  variation  Sums  of  squares 

General  combining  ability  HC[i,g,m,c,r)  -  R(ia,m,c,r) 

Maternal  ability  P-(|x,g,m,c,r)  -  R(ia,g,c,r) 

Specific  combining  ability  R(l-i->g*m,c,r)  -  R(ii,g,m,r) 

Residual  reciprocal  effects  R(^L,g,in,c,r)  -  R(|j.,g,m,c) 

The  sum  of  squares  for  error  can  be  computed  as 

^^iik  "  f^(M->g^m,c,r) 

as  previously  noted.    Since  all  of  the  variability  among  the  subclasses  is 
accounted  for  by  the  constants  being  fitted,  the  total  reduction  in  sum  of 
squares  can  be  obtained  by  calculating 
Y?  . 

-r-;  =  reduction  due  to  fitting  all  constants. 

The  degrees  of  freedom  for  the  model  described  are  calculated  as  shown 
below: 

v^ource  of  Va-^\ation  Degrees  of  Freedom 

Total  n,,  -1 

General  combining  ability  p-1 

.  Maternal  ability  p-1 

Specific  combining  ability  p(p-3) 

Residual  reciprocal  effects  p(^"3)  ^1 

Error  no.-p(p-l) 

This  it  can  be  sesn  that  with  only  three  lines  included  in  the  study, 
there  ;;ould  be  no  degrees  of  freedom  available  for  the  estimation  of  specific 
combining  abilities,  and  only  one  degree  of  freedom  available  for  the  esti- 
mation of  residual  reciprocal  effects. 

Upon  completion  of  the  analysis  of  variance,  it  can  be  determined  which 
effects  are  significant  and  should  be  retained.    If  some  of  the  effects  do  not 
appear  signifi-^ant,  the  equations  for  these  effects  can  be  deleted  from  the 
model  aid  more  efficient  estimaices  of  the  remaining  effects  obtained.    The  in- 
verse of  the  variance-cov?.riance  matrix  when  one  or  more  sets  of  effects  have 
been  eliminated,  C^l,  is  given  by 

=       -  R?-iR« 
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where  Z  is  tho  segment  of  the  matrix  inverse  to  the  variance-covariance  matrix 
corresponding  by  row  and  column  to  the  effects  being  eliminated,  H(R')  is  the 
segment  of  the  matrix  inverse  to  thp.  varinnce-covariance  matrix  associating 
these  effects  .rith  the  effects  to  be  retained,  and        is  the  segment  corres- 
ponding by  row  and  coluinn  to  the  effects  being  retained. 

The  new  constant  estimates  of  the  effects  to  be  retained,  Bj^,  can  be 
obtained  by  means  of  the  formula 

B^  -  B2  -  RZ-lBp 

where  B2  represents  the  constant  estimates  for  the  effects  to  be  retained^ 

before  adjustment,  and  B-j^  represents  the  constant  estimates  of  the  effects 

being  eliminated.    The  definitions  of  R  and  Z  are  given  above.    The  new  con~ 
stants  can  also  be  obtained  by  omitting  the  RHM's  of  the  effects  being  elim- 
inated and  multiplying  the  TlKI''s  by  each  row  of  the  new  matrix  jjiverse.  The 
elements  of  the  new  matrix  inverse  and  the  new  constant  estimates  are  those 
which  would  have  been  obtained  had  the  effects  wliich  were  eliminated  not  been 
included  in  the  original  model. 


Estimation  of  Variance  Components 

If  all  effects  are  assumed  to  exist  and  t>©  lines  are  regarded  as  random, 
the  expectations  of  the  mean  squares  are  as  shorn  below: 

Source  of  Variation  E(MS) 


The  k]_,  k^D  k^p  and  kg  values  can  be  calculated  by  the  direct  procedure 
explained  in  the  section  discussing  the  two-way  classification  -tdthout  inter- 
action.   The  k2,  kj^^,  k^,  and  ky  values  car.  be  calculated  by  the  indirect  pro- 
cedure explained  iji  the  sarae  section.    The  variance  component  estjjnates  are 
computed  by  setting  the  above  expectations  equal  to  the  computed  mean  squares 
and  solving  the  resulting  equations. 


General  combining  ability 

of  * 

k6<^ 

Maternal  ability 

Specific  combining  ability 

Residual  reciprocal  effects 

Error 

of 

Numerical  Example 

The  computational  procedures  outlined  above  for  setting  up  the  least- 
squares  equations,  imposing  restrictions  on  the  equations,  obtaining  sums  of 
squares  and  degrees  of  freedom  to  complete  the  analysis  of  variance  and 
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estimating  the  components  of  variance  are  illustrated  "with  a  set  of  single 
cross  data  involving  four  lines.    The  subclass  numbers,  totals,  and  means  are 
presented  in  Table  12. 


Table  12. 

Subclass  numbers 

,  totals. 

and  means 

Line  of 

Line 

of  Dam 

—  Sums 

Sire 

1 

..  ^ 

3 

\x 

1 

Nurber 

Total 

Mean 

22 
1231.0 

12 
767.2 
63.933 

10 
621.0 
62.100 

12 
65.625 

56 
3aOo. ( 

c 

Number 
Total 
Mean  ' 

12 
712.0 
59.333 

26 
1725«0 
66.3i;6 

10 
597. Ii 
59.7I1O 

10 
olO.!? 
61.050 

58 

3 

Number 

Total 

Mean 

12 
771.3 
bli.275 

12 
773.9 
6U.U92 

12 
61i5.5 
53.792 

12 

•700  0 

66.658 

U8 

Number 

Total 

Mean 

111 
860,8 

61.I186 

8 

U59.2 

57.ii.00 

12 
5o.li5o 

12 
731.6 

60.967 

he 

2657.0 

Number 
Sums  Total 

60 
3575.1 

58 
3725.3 

2li69,3 

lib 
2929.5 

208 
12699.2 

These  data  are  the  28-day  weights  of  rats  resulting  from  an  experiment  by 
Kidwell,  et  al.ity     The  observations  are  the  sex  by  litter  means  and  only  those 
litters  with  rats  of  both  sexes  are  included  in  the  analysis.    Therefore,  the 
sex  effects  are  orthogonal  to  the  subclass  differences.    The  least-squares  co- 
efficient matrix,  using  the  model  previously  presented  in  this  section,  is 
shown  in  Table  13.    It  can  be  observed  that  the  segment  corresponding  to  the 
rovjs  and  columns  to  the  general  combining  ability  is  filled  in  completely, 
whereas  the  segments  corresponding  by  row  and  column  to  the  other  effects  have 
entries  only  on  the  diagonals.    In  addition,  the  RHM's  for  the  general  combin- 
ing ability  do  not  equal  the  suin  for  the  \i  equation. 

The  least-squares  coefficient  matrix  is  then  reduced  by  imposing  the 
restric\.ions  previously  presented  and  this  reduced  matrix  is  shown  in  Table  II4. 

If  the  equations  are  solved  at  the  same  time  the  matrix  is  inverted,  by 
including  the  RKl'»s  in  the  equations,  then  the  constant  estimates  are  obtained 
directly.    If  the  solution  to  the  equations  is  not  computed  directly,  the  con- 
stants can  be  calculated^ from  the  inverse  elements  and  the  Rilled' s  of  the 
equations,  since  2C'^^Y-=  c- 

as  noted  previous ly^   ^    

lii/    Kidwell,  J,  F.,  H.  J.  Weeth.,  W,  R.  Harvey,  L.  Ho  Haverland,  G.  E. 
Shelby,  and  R.  T*  Clarke    Heterosis  ra  crosses  of  jjibred  lines  of  rats. 
Genetics  U5s 2 25-231.  I960. 
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Table  Ih*    Reduced  least-squares  equations  and  RHM's. 


Si 

g2 

^1 

A 

C]_2 

C13  ri2  H3 

PPM 
runrl 

y- 

g3 

r23 

[I  : 

136 

ii 

-ii 

0 

1; 

-2 

-2 

0 

-8 

U 

0 

-U 

8366.1 

Si  • 

~T 

88 

1;8 

lo 

i|6 

26 

20 

0 

li 

-2 

-2 

2 

396.^ 

g2  ' 

-h 

UH- 

96 

U8 

22 

I18 

22 

U 

-2 

-2 

-2 

-203.1 

ml  : 

0 

Uo 

88 
20 

20 

26 

li2 

ii 

8 

-II 

-u 

0 

li5.6 

k 

22 

72 

31 

3U 

-2 

0 

li 

2 

-2 

1U6.2 

: 

-2 

26 

m 

26 

3ir 

66 

3u 

0 

-2 

6 

0 

-6 

-197.6 

m3  : 

-2 

20 

22 

h2 

3h 

3ir 

66 

2 

2 

2 

_o 

-U 

-37U.1 

0 

0 

ii 

k 

-2 

0 

T 

96 

li8 

-2 

-2 

2 

-135.1 

C13: 

-8 

h 

8 

0 

-2 

2 

US' 

88 

0 

-ii 

0 

-557.6 

1:12: 

^23* 

h 

-2 

-2 

-h 

ll 

6 

2 

-2 

0 

68 

26 

-18 

279.8 

0 

-2 

-2 

-U 

2 

0 

-2 

—2 

-U 

72 

2U 

117.5 

-1; 

2 

-2 

0 

-2 

-6 

-h 

2 

0 

-18 

211 

6k 

-133.3 

The  inverse  elements  for  the  above  matrix  are  presented  in  Table  13^  together 
vdth  the  constant  estimates.    The  total  redixction  in  sum  of  squares  due  to  fit- 
ting all  constants  can  be  obtained  by  multiplying  the  column  vector  of  all 
RHM's  by  the  roxv  vector  of  all  constant  estimates.    The  sane  result  is  obtained 
by  using  the  constants  estimated  from  the  reduced  matrix  and  the  corresponding 
reduced  RHM's,    For  example, 

R(ix,g,m,c,r)  =  (6l.378)(8366a)+(2.9Ul)  (396.5)+   +   +  (1.229)  (-133.3)=  517,127 

This  reduction  in  sum  of  squares  can  also  be  obtained  from  the  subclass 
numbers  and  totals  given  in  Table  12,  as 

,      (767.2)2      (621.0)2                 (605.1.)=^  _ 
R(^i,g,m,c,r)  =   +  —-^        +   +   +   =  517,131 

Since  only  three  significant  digits  were  carried  in  some  of  the  cons tan'  esti- 
mates the  two  answers  do  not  agree  exactly. 

The  next  step  is  to  calculate  the  error  sum  of  squares.  Nonnally  this  is 
obtained  in  a  least-squares  analysis  from 

2 

ZZ2  yijj^  -  R(all  constant."). 

ijk 

However,  in  these  data  the  sex  and  sex  by  subclass  effect  would  remain  in  the 
error.    The  sum  of  squares  for  error,  therefore,  was  obtained  by  also  subtract- 
ing the  corirected  S!iTt?s  of  squares  for  sex  and  sex  by  subr*lass  from  the  total 
uncorrected  sum  of  squares.    The  resulting  error  sum  of  squares,  with  136-2U  = 
122  degrees  of  freedom,  was  found  to  be  5,786,3. 

Since  the  differences  among  the  rj^^  were  small,  and  there  is  little  bio- 
logical basis  to  expect  these  effects  to  exist,  a  test  of  significance  was  made 
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to  determine  whether  this  variable  could  be  deleted.     The  adjusted  sum  of 
squares  for  R,  the  residual  reciprocal  effects,  is  obtained  from  the  general 
formula  B'Z-1b  as  follows: 


S. 


=  E767  .981i  1.229) 
=  [^5.3305  119.U853  88.03U6] 


.767 

.98U 

[ 

1.229_ 

n 

"  .767^ 

.98U 

1.229 

.767 
•98U 
1.229 


268,21 


The  test  for  the  significance  of  differences  among  the  rj_j  is  then  complet- 
ed as  follows: 


Source  of  Variation 

Residual  reciprocal  effects 
Error 


d_^^       Sum  of  Squares    Mean  Square  F 

1.89 


3 
122 


268.2 
5786.3 


89.  A 
A7.4 


Since  the  residual  reciprocal  effects  were  non-significant  they  were  deleted 
from  the  analysis  in  order  to  obtain  more  efficient  estimates  of  M,gj,mj^  and  sij 
the  inverse  elements  of  Table  lA  were  adjusted  by  use  of  the  formula 

The  RZ^-4l'  matrix,  which  was  subtracted  from  the  equations  in  the  segment 
corresponding  to  the  remaining  effects  is  presented  in  Table  I6. 


Table  I6.    The  RZ^^'  matrix  used  to  delete  the  r^y  (xlO^) 


1^ 

SI 

§2 

§3 

^1 

m2 

m3 

ci2 

^13 

28 

-33 

-23 

-31 

8 

103 

20 

-28 

27 

"33 

127 

-35 

U8 

-91 

^91 

-3 

33 

28 

-23 

^ 

101 

35 

17 

-lli9 

0 

23 

-63 

"31 

"IF 

hh 

-35 

"130 

6 

31 

-22 

8 

-91 

17 

-IF 

112 

35 

-k6 

-3 

-51 

103 

-91 

-li^9 

-130 

"3F 

iilil 

51 

-103 

121 

20 

-3 

0 

6 

-ii6 

TI 

50 

-20 

37 

-"28 

33 

23 

31 

-8 

-103 

28 

-27 

27 

28 

-63 

-22 

-51 

121 

37 

68 
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These  valued  were  subtracted  element-by-element  from  those  in  the  segment  for 
the  remaining  effects,  resulting  in  thf^  new  inverse  matrix  of  the  variance- 
covariance  matrix  with  the  rj_j  effects  deleted.    This  matrix  is  pressnted  in 

Table  17. 

Table  17.    The  inverse  matrix  (C^^)  of  the  variance-covariance 
matrix  with  the  r^j  effects  deleted*  (xlO""^) 


Si  ' 
g2  ' 
g3  ' 
mi  ! 

m2  ! 
m3  ! 

C12' 
C13! 


v- 

gl 

g2 

g3 

mi 

m2 

^3 

C12 

ei3 

7U70 

"kOl 

6S8 

-U53 

-603 

71 

67I4 

-526 

982 

25'52]4 

-93I1O 

-69I49 

-16576 

5039 

5732 

Uoi 

»375 

668 

25i;9ii 

-855h 

608U 

-17559 

5729 

-668 

-61 

-h53 

-69liS> 

2iill9 

5615 

i;557 

-15619 

U53 

-131? 

-603 

-16576 

608i4 

31ii36 

-10li52 

603 

-395 

71 

5039 

-17539 

U557 

35U96 

-11509 

277 

1268 

67U 

5732 

5729 

-15619 

-nfn? 

33283 

-67ii 

310 

-526 

ItOl 

-668 

li53 

603 

277 

lijlil5 

^7926 

982 

-375 

61 

-1317 

-395 

1268 

310 

35937 

Since  the  r^j  were  assumed  to  equal  zero,  these  RHM's  were  omitted  and  the 
new  constant  estimates  obtained  as  before^  by  multiplying  the  RHM*s  by  each  row 
of  the  new  inverse.    These  constants  are  shown  belowt 

10-6  ^fi^jo)  (8366.1)  +  (-iiOl)  (396.5)+  +  +(932)  (-557.6)j  -  61.35 

g^^  -  2.9U  -  -1.53  -  -1.33 

g2  --1.3U  -    1.38  »  -.22 

g^  «  2.73  ra^  -  -5.89 

The  remaining  constants  (g||>^j^*^l]^>^23*^2l;'^3li^        ^®  obtained  by  differences^ 
since  gi+g2+g3"*"gl^''  mi+in2+m3+m^''  0  and  the  c^^  sum  to  ssero  by  row  and  column  and 
p-1^ 

for  ZTiCa  ;» 

ij 

Tlie  reduction  in  sum  of  squares  due  to  fitting  M-^g*"^*  ^  ^®  then  calcu-° 
lated  by  multiplying  the  reduced  RHM's  by  the  corresponding  constant  estimates^, 
as  f  olJIows 

H(l^,g,m,c)  -  (6l.35)(8366a)+(2.9l*)(396.5)+  +  +  (*22)(-557.6)  -  5l6,li72/, 

Normally,  this  reduction  in  sum  of  squares  would  be  subtracted  from  the 
total  uncorrected  sum  of  squares  to  obtain  a  new  error  term.    However,  since  a 
difference  in  sexes  was  also  involved  in  this  analysis,  the  error  term  computed 
from  among  the  litters  within  the  sex  by  cross  subclasses  was  used. 
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Completion  of  the  Analysis  of  Variance 

In  order  to  complete  the  analysis  of  -variance,  it  is  only  necessary  to 
compute  the  sums  of  squares  for  the  general  combining  ability  effects,  maternal 
effects,  and  the  specific  combining  ability  effects.    The  sum  of  squares  for 
the  general  combining  ability  effects  is  computed  as  follows : 

S.  Sqs.  G  =[2.9U    -1.3U    2.73]r.025^2lt    -.0093UO  -.0069U9 


0093iiO 
,006919 


.025U9li 
►00855U 


-.008551 
.021119 


'1.3h  2, 


,'4003 


30.3li87 
60J4735 
30.1920 


303 
[27.I4C 

[203.9050    90.6118    20ii.0719]  r^*9h\ 


27.)4003 
30.1920 
60.0631 


=  1035.2 

The  sum  of  squares  ."or  the  maternal  effects  is  computed  in  a  similar 
fashion: 


S.  Sqs. 


-M  =[-1.53 


1.38 


•5.89] 


031I136  -.010U52  -.011U12 

010U52  .O35I496  -.011509 
01ll|12    -.011509  .033283 


^1.53   1.38  -5.89j 

[2: 

^183.7916   -IOU.3031  -276,2250) 


21.8113 
Ul.69l|2 
2i.896ii 


23.9123 
21.896U 
U5.8160 


-1.53 
1.38 
-5.89 


=  1763.5 

The  sum  of  squares  for  specific  combining  ability  is  computed  similarly: 
S.  Sqs.  ( 


1-1.33  -.22]  r  .011Utl5  -.007926" 
^  [^,007926  , 


>0l5937 


=  [-1.33   -.22]  r95.U826 


17. '4867 

86 » 3639 


-1 


(i37.)4389  82,157)3^-1.33] 


200.9 


The  completed  analysis  of  variance  is  presented  below: 


Source  of  V ariation 


d.f. 


Sum  of  Squares     Mean  Squares 


General  combining  abilities  (G)  3 

Maternal  abilities  (M)  3 

Specific  combining  abilities  (C)  2 

Error  122 


1035.2 
1763.5 
200.9 
5736.3 


3U5.1 


loo.U 

ii7.U 


Since  the  specific  combining  ability  effects  are  not  significant,  these 
equations  could  be  deleted  from  the  model  as  were  the  equations  for  the  sex- 
linkage  effects.    Assuming  the  specific  combining  ability  and  the  sex-linkage 
effects  to  be  zero^  the  mean  square  for  error  is  the  appropriate  term  for  test- 
ing the  significance  of  the  general  combining  ability  effects  as  well. as  the 
maternal  ability  effects  xi/hen  a  random  model  is  considered.    Should  the  line 
effects  be  considered  fixed,  as  in  crossing  breeds  of  cattle  where  similar 
samples  of  the  breeds  could  be  selected  for  repeated  experiments,  the  mean 
square  from  the  error  line  would  be  appropriate  for  tasting  all  effects. 


Estimation  of  Variance  Components 

Variance  component  estimates  can  be  calculated  readily  in  this  analysis  by 
making  use  of  the  inverse  elem.ents  of  the  segments  which  were  inverted  to 
obtain  the  sums  of  squares    to  calculate  the  coefficients  for  the  variance 
components.    The  coefficient  for  the  major  variance  component  in  each  line  of 
the  analysis  of  variance  is  computed  by  means  of  the  general  formula 


presented  in  the  section  on  the  two-way  classification  without  interaction,  and 
again  not  to  be  used  for  interaction  variance  components. 


k  = 


JL  

17M  -  d.f.  EZ7>j 
i  i-i> 


m 


The  expectations  of  the  mean  squares  are 


shown  below. 


Source  of  variation 


d.f. 


E(MS) 


Maternal  abilities  (M) 


General  combining  abilities  (G) 


3 


3 


Error 


Specific  combining  abilities  (C) 


122 
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utilizing  the  inverse  elements  of  the  matrix  which  was  used  to  calculate  the 
sum  of  squares  for  specific  effects, 

181.8U6^  -  I47.I1867 
=   =  22.39 


100.'4  -  U7.U 


and       3?  =   ^  =  2.U  . 


22.39 


Performing  the  same  calculations  with  the  inverse  elements  of  the  matrix 
used  to  calculate  the  sum  of  squares  for  maternal  effects, 

,     _  135.2^38  -  I45.O8OQ  ^, 
-  ^  "  '^'^•pU 

.      ^     ^87,8  -  hl.h     ^,  ^ 
and       05^  =   -1         =  2U.0  * 

22. $h 

The  inverse  elements  used  in  the  computation  of  the  sum  of  squares  for  the 
general  combining  abilities  are  then  used  to  calculate  k^,  as 

178.2813-58^6273 

\   1;  "  29.91 . 

The  coefficient  for  the  component  of  o§  appearing  in  the  £(143)  for  the 
general  combining  abilities,  k-^,  may  also  be  determined  from  the  matrix  used  to 

compute  the  s\im  of  squares  for  general  combining  abilities,  when  all  subclasses 
are  filled.    The  formula  for  calculating  this  value  in  this  case  is 

i  ij, 


k 

Utilizing  this  formula. 


m(m-2) 


I 


and 


178.2813  -  58.6273 
^3  r-  =  li;.96 


3ii-5.1  '  hl.h  -  (lii.96)(2.U). 
29.91 


Mean  Separation 

Since  significant  differences  were  found  among  the  maternal  effects,  mean 
separation  procedures,  such  as  Duncan's  Multiple  Range  Test,  can  be  applied  to 
determine  which  means  differ  significantly  from  one  another.    If  the  lines  or 
breeds  were  considered  as  fixed,  significant  differences  \TOuld  a3.so  have  exist- 
ed among  the  general  combining  ability  effects.    Since  the  mean  separation 


procedures  "Urf  the  saraa  for  all  effects^  the  test  be  used  only  for  the 

general  conibining  abilities.    The  computational  details  regardirig  this  test 
were  discussed  in  both  the  first  and  second  sections,  so  the  results  of  this 
test  are  presented  in  Table  18  without  further  explanation. 


:able  18 


Mean  sepairation  for  general  combining  ability 

»».^  Multiple  Range  Tes±„(.0^  level) 


Comparison  if±-f^)  V^^jjjj.gcij 


Product 
dljfferences 


3|  Epn2 


vs 

S3 

,21 

5o6lO 

1.18 

19.28 

% 

rs 

h. 

if28« 

5.357 

22.93 

20e31 

TS 

7.27* 

5.366 

39.01 

21*00 

TS 

ii«07* 

5.h75 

22,28 

19.28 

h 

h 

vs 

h 

7.06* 

5.i;72 

38,63 

20,31 

h 

VS 

h 

2,99 

$Jx99 

19.28 

Uae  of  the  Transformation  Matrix 

The  "wrriting  out  of  the  coefficient  matrix,  the  subsequent  subtractions 
performed  in  imposing  restrictions  on  the  constant  -astiniates,  and  the  fijial 
inversion  of  the  matrix  is  a  time-consuming  process.    Much  of  the  work  can  be 
eliminated  if  a  transformation  matrix  is  available^  or  can  be  witten  out 
quickly,  as  is  possible  with  marQr  designs.    Advantage  can  then  be  taken  of  the 
faco  that  the  inverse  of  the  matr5jc  of  subclass  numbers  (a  diagonal  matrix)  can 
be  obtained  readily  as  this  is  nothing  more  than  the  reciprocals  of  the  sub~ 
class  numbers.    In  addition,  the  constants  are  merely  the  subclass  means. 
Transformation  of  the  inverse  of  the  subclass  numbers  then  yields  the  inverse 
iTiatrix  that  would  have  been  obtained  imder  the  methods  outlined  in  this  section. 
This  transformation  is  accomplished  by 

where  K  «  the  transformation  matrix,  D~l  «  the  inverse  to  the  matrix  of  sub- 
class numbers,  and       is  the  transpose  of  the  transformation  matidx.    Trans «• 
formation  of  the  constant  estimates  (the  subclass  means)  then  yields  the  con- 
stant estimates  that  would  have  been  obt-ained  by  the  methods  of  this  section. 
Transformation  of  the  subclass  constant  estimates  is  accomplished  by 

KB 

where  K  is  the  transformation  matrix  and  B  is  the  vector  of  subclass  means. 

The  transformation  matrix  for  the  model  under  discussion  and  the  general 
fomulas  for  writing  out  the  transformation  matrices  for  single  cross  data 
involving  more  than  four  lines,  when  the  linebreds  have  not  been  included,  have 
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been  given  by  Karvey.l^/  The  transformation  matrix  for  the  model  under  discus- 
sion, the  subclass  means,  the  reciprocals  and  the  transfoimed  constant  estimates 
are  presented  in  Table  19. 

Table  19.    The  transformation  matrix:  (K),  the  subclass  means  the  reciprocals  of 
the  subclass  numbers,  and  the  transformed  constant  estimates. 

Constant 
Estlinates 


si2Sl3SiI;S2iS23S2hS3lS32S3UsUlSU2SL.3  Means 


Reciprocals 


1^ 

'2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

63.933 

.0833333 

61.378 

6 

6 

6 

0 

-3 

-3 

0 

-3 

-3 

0 

-3 

-3 

62.100 

.1000000 

2.9iil 

0 

-3 

-3 

6 

o 

6 

-3 

0 

-3 

-3 

0 

-3 

65.625 

.0833333 

-1.29U 

-3 

0 

-3 

0 

-3 

6 

6 

6 

-3 

0 

59.333 

.0833333 

2.753 

-6 

— o 

-6 

U 

n 

o 

u 

n 
u 

i^o  7lin 

♦xuuuuuu 

m^T 

6 

0 

0 

-6 

-6 

-6 

0 

6 

0 

0 

6 

0 

6l,050 
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The  transformed  constant  estimates  shoim  in  the  last  column  of  table 
were  computed  as  follows: 

-  (2)(63.933)  ^  (2)(62.100)  +  +  +  (2)(50.l!50) 
,  =   _   _ 


61.378 


Si 


(6)(63.933)  +  (6)(62.100)  +  ^   -  (3)(5o.l!50) 
21; 


2.9l;l 


'23' 


(3)(63.933)  "  (3)(62aao)  +  +   -  (3)(50.l!5o) 

2U 


1.229 


These  constant  estimates  are  the  same  as  those  presented  in  table  i;  snd  thus 


15/   Harvey,  Walter  R»    Analysis  of  data  with  unequal  subclass  nombers 
when  a  set  of  orthogonal  comparisons  among  the  subclass  means  If  desired. 
Paper  presented  in  the  Biometrics  Society  Section  of  the  AIBS  meetings  at 
Pennsylvania  State  University,  Aug-dst  31,.,  1959. 
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are  the  same  as  would  have  been  obtained  by  the  T.ethods  outlined  previously  in 
this  section.    The  inverse  elements  are  obtained  as  follows: 


C^l^  ^  (2)^(*0833333)+(2)^(aOOOOOO)  -^(2 )^( ,0833333)       ^  ^^^^g 

C^&l  =  (2)(6)(,c633333)^(2)(6)(aQQQOOO)4-         ^  (2)(3)(,0833333)  ^  „^^,oq^3^ 

(210^ 

Cgigi  =  (^)^(-^833333)^(6)^(aO000O0)  4-    (-3)^(>C8 33333)        ^  .02^6^1 

576 

Cgigi  =  (^)(-3)(»10000QO)-(6)(3)(.0833333)  +  +  +(3)(3)(>0833333)  ^  .^00^3^5 
etc. 

While  the  transformed  inverse  matrix  C"^  can  be  calculated  on  a  desk 
calculator,  these  matrix  multiplications  can  be  completed  readily  on  high 
speed  computers,  saving  considerable  time.    Upon  completion  of  the  inverse 
matrix,  the  analysis  would  proceed  as  outlined. 


SimTAK^CUS  AKAl^ySIS  CF  TATA  FROK  CROSSES  AND  PUREBREPS  I 

When  it  is  possille  to  include  linebreds  or  purebreds  in  the  design  of  a 
single  cross  experiment  an  estimate  of  heterosis  is  then  available.    The  line- 
bred  effects  ma-i'-  be  estimated  independently'-^  of  or  siirailtaneouslA'  with  the 
generaJ  combining  abilitv  and  maternal  effects.    The  analysis  to  be  considered 
in  this  section  includes  the  linebreds  in  the  experiment  but  estimates  these 
effects  independentlv  of  the  general  combining  abilitv  and  mi3ternal  effects. 


Model 

The  mathematical  mcriel  underlying  this  anal^'-sis  is 

yhijk  =  (1  +  PT^ii  H.         -f  g2j  +  m2j  +  0.3^^  +  r2ij  ^  ehi^ik  ....  (8) 

where  yhijk  ~  'the  k^^'  observation  on  the  progeny  of  a  mating  of  a.  dam  from  the 
j't^'h  line  with  a  sire  of  the  ith  Une  ir  the  hth  tvpe  of  breeding  (purebred  0- 
crossbred);  |i  =  the  over-all  m.ean  when  equal  subclass  freouencies  exist  and 
thepe  are  equal  frequencies  of  linebreds  and  crossbreds;  ah  ~  an  effect  com- 
mon to  all  progenv  of  the  h"^^  tvpe  of  breeding  (linebred  or  crossbred),  the 
difference  between  these  effects  being  an  estimate  of  heterosis;  and  Pi±±  ~ 
an  effect  com.!Ton  to  all  nrogenv  of  a  mating  between  a  dar  of  the  i'th  line  and 
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a  sire  of  the  i"*^^  line.    The  reiraining  svmbols  represent  the  same  effects  as 
described  in  model(7).    The  subscript  1  denotes  that  the  effects  are  measured 
only  among  the  Drogenv  of  the  first  type  of  mating  (linebred)  and  the  sub- 
script 2  denotes  that  the  effects  are  measured  only  among  the  progeny  of  the 
second  type  of  mating  (crossbred). 


Least-Squares  Equations 

The  least-squares  eq^iatiors  for  this  model  are  presented  in  Table  20.  It 
can  be  seen  from  these  equations  that  while  the  sums  of  the  coefficients  for 
the  heterosis  effects  equal  the  coefficient  for  \i  ir  the  [i  equation,  none^of 
the  sums  of  coefficients  for  the  other  effects  equal  the  coefficient  for  \l. 
Also,  in  the  sums  miaking  up  the  RH>!'s,  onlv  the  sum  of  the  RHM's  for  the 
heterotic  effects  equals  the  sum  for  the  |i  equation.    Since  the  linebreds  are 
not  used  in  estimating  the  general  combining  abilities  or  maternal  abilities, 
and  are  not  involved  in  estlnHting  specific  or  sex-linktj  effects,  several 
cells  in  the  table  contain  no  entries.    Again,  it  is  noted  that  in  the  seg- 
m.ent  of  the  coefficient  matrix  corresrjonding  bv  row  and  column  to  the  general 
combining  ability  effects,  there  are  entrirs  in  the  off -diagonal  elements. 
In  the  segm.ents  corresponding  to  the  other  effects,  onlv  zeros  apnear  in  the 
off-diagonals. 


Imposing  Restrict! ens  on  the  Constant  Estimates 

The  restrictions  imnosed  on  the  constant  estimates  with  this  model  are 
similar  to  those  of  model  (7)  except  that  the  tvoe-of -breeding  effects  and  the 
r*urebred  effects  must  be  taken  into  account.    While  many  restrictions  are 
possible,  the  following  are  convenient  for  this  analysis: 

fah  =  =  ?g2i  =  2^2T»z3i.  =  Zc^i  =  ZZ    Z^a  =  I^^a  =  Z^^^  =  t^a+t^i  =  0. 

h         1  1  ^i^j'-       ij^i  j'' 

These  restrictions  force  the  constant  estimates  for  tiTDe-of -breeding-effects 
to  sum  tc  zero  about  }1    the  constant  estimates  for  nurebred  effects  to  sum  to 
zero^about  p,  +  a^,  and  the  remaining  constant  estimates  to  sum  to  ze^'O  about 
jj.  +  The  rf^  strict  lens  on  the  constant  estimates  for  the  a^  and  Vi±± 

effected  b-^'  subtraction  bv  row  and  column  as  r'iscussed  in  the  first  section. 
The  remaining  restrictions  are  imposed  bv  the  same  procedures  as  outlined  for 
model  (7). 


Inversion  of  the  Reduced  Matrix 

After  the  restrictions  discussed  above  have  been  im'posed  on  the  constant 

estimates^  there  will  remain  a  symmetric  matrix  of  order  d^,  viriiere  d  -  the 

number  of  lines  included.    Inversion  of     matrix  of  this  order  is  of  course 
m.ost  readily  accomplished  on  a  high  sr>eed  corrouter. 
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Computing  the  Least-Squares  Means  and  Standard  Errors  (Fixed  Model) 


The  constant  estiirates  for  the  least-squares  mean  and  all  effects  in- 
cluded in  the  model  can  be  obtained  bv  solving  the  equations  together  with  the 
inversion  of  the  variance-covariance  iratrix.    If  this  is  not  Hone,  then  the 
constants  can  be  estiirated  from  the  inverse  elements  and  the  RHM's  of  the 
equations  by 

2  CijYj  =  Ci 

where  C^j  is  the  inverse  element  for  the  ith  row  and  j"th  column  of  the  matrix 
inverse,  Y-j  is  the  RHM  for  the  J^h  row  and  ci  is  the  estimate  of  the  ith  con- 
stant.   Since  the  constant  esti-nate  for  the  t-'TOe-of-breeding  effect  is  deter- 
mined as  a  deviation  from  p.,  the  least-squares  mean  for  the  nurebreds  is 
given  by 

+  ai 

and  the  least-squares  m.ean  for  lirecrosses  or  crossbreds  is  gjven  by 
jl  +  a2 

The  least-squares  mean  for  a  purebred  grcuD  is  then 

a  +  ai  +  T^i±± 
and  the  least-squares  m:ean  for  a  linecross  group  is 

(I  +  a2  +  g2i  +  g2j      ^3      ^2ij      ^2ij  . 
The  standard  errors  for  these  means  are  r<eterm.ined  as  follows: 


(ui+a2+g2i+g2o+^2j-^C2ij+r2ij) 

+  CC2ijC2ij  +  (52ijr2i3  ^  2C^^2  +  2CM-g2i  +  2C^^2^  +  2C^^''2j  +  20^^21^ 

+  20^2io  +  2C^2g2i  +  2C^2g2j  +  20^2^21  ^  2C^2C2ij  +  2C^2r2ij 

=  2Cg2ig2j  +  2Cg2im2j  +  2Ce2iCij  +  2CS2i*'2ij  +  20^2  3^12  j  +  2Cg2jC2ij 


+  2Ce2jmj  +  2C"'2jC2ij  ^  2C^2jmj  2C<^2ijr2ijy 


where  the  C's  denote  elements  of  the  rnatrtx  inverse  to  the  variance-covariance 
matrix,  the  superscripts  designate  the  particuler  elements  and  Oq  is  the  error 
standard  devj.atlon«,    VJhile  the  standard  errors  for  the  over-all  mean,  pure- 
breds ,crospbreds  and  purebred  groups  can  be  obtained  readily,  it  is  obvious 
that  considerable  effort  is  involved  in  obtiiining  the  standard  errors  for  the 
crossbred  groups  by  the  use  of  the  forTrrala  given  above,    A  shorter  method  of 
obtaining  these  constants  will  be  presented  later. 

The  error  variance j  a|,  is  normally  obtained  by  the  following  formula 


2ZZZ  ygiik  -  R(!i.,a,g,m,Cjr) 

=  „ 

d.f.  for  error 


where  2ZZZ  v^^^  =  the  total  uncorrected  sum  of  squares,  and  R(M.,qsg>"^jCar)  " 

hijk 

the  total  reduction  in  sum  of  squares  due  to  fitting  all  constants.    The  total 
reduction  in  sum  of  squares  is  obtained  by  the  same  procedures  as  presented 
for  model  (7). 


Cojnputing  Sums  of  Squares  for  the  Analysis  of  Variance 

The  sums  of  squares  are  obtained  by  the  procedures  discussed  previously, 
appl-\'i.ng  the  formula 

B'Z"^  B 

where  B'  is  a  row  vector  of  the  constant  estiirates  for  the  effect  considered, 
B  is  a  column  vector  of  these  constant  estimates,  and  Z  is  the  segment  of  the 
matrix  inverse  to  the  variance-covariance  matrix  corresponding  by  rows  and 
columns  to  the  particular  set  of  effects.    The  sums  of  squares  may  also  be 
obtained  by  differences  in  reductions* 

The  breakdown  of  the  degrees  of  freedom  for  the  analysis  under  model  8 

are 


Source  of  Variation  Degrees  of  Freedom 

Total  n...  »  1 

Heterosis  1 

Among  pur eb reds  p-1 

General  combining  abilities  p-1 

Maternal  abilities  p-1  ^ 
Specific  combining  abilities 

Sex-linked  effects  "*  Pl^:!!. 

Error  n.,,  -i 


+1 

.2 
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where  p  is  the  number  of  lines  included  in  the  experiment. 


Construction  and  Apnlication  of  Transformation  Matrices 

The  analysis  of  data  of  the  type  described  by  model  (C)  can  be  greatly 
facilitated  by  the  use  of  a  transformation  matrix.    The  method  of  constructing 
a  transformation  matrix  to  be  presented  here  for  model  (8)  is  a  general  one 
and  can  be  followed  for  other  least-squares  analyses.    As  a  first  step,  the 
least-squares  coefficient  matrix  is  written  out  following  the  equations  de- 
scribed for  this  model,  considering  only  one  observation  in  each  of  the  p^ 
subclasses.    Imposing  the  suggested  restrictions  on  the  constant  estimates  re- 
sults in  the  reduced  least-squares  coefficient  matrix  presented  in  Table  21. 


Table  21,    Reduced  least-squares  coefficient  matrix  with  one 

observation  per  subclass  for  model 
■       Tbijk  =  H  +  aji  +  piii  +  g2i  +  g2j  ^  '»2j+C2ij     ^2ij  +  ©hijlc 
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This  matrix  is  inverted  rear'ily  since  each  of  the  five  sm.all  segments  can  be 
inverted  separatelv  to  yield  the  matrix  inverse  shown  in  Table  22.    The  next 
step  is  to  write  out  algebraically  the  subclasses  ^ich  are  used  in  measuring 
each  of  the  effects  included  in  the  model,    Imnosing  the  appropriate  re- 
strictions on  these  valuer,  the  following  RHM's  are  obtained: 

M-.  ZZsij 


^111- 

sil  - 

S22  - 

P133- 

S33  - 

g21* 

i 

+  Zsii,) 

g22* 

(ZS2J 

>  Zsi2) 
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1 

g23* 

(f33 

"  pi3^ 

+  Zsi^.) 

m2i: 

?Hl  ■ 

1 

■  l^ih 

^22* 

fi2- 

- 

m23  5 

f  13  • 

•  iHh 

Table" 22.    Matrix  inverse  to  least-scnares  coefficient  matrix 
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^212'  si2  -  ^i.h  '  %2  -  ^21      ^2h  *  %1  . 
^213'  ^13  ■  ^Ih  '  ^31  "  ^31'  "  ^Iil  "  %3 
^223'-  ^23  -..^2h  -  S32  +  ^31  "  ^)i2  - 

where  Sj_j  refers  to  tne  subclass  for  the  prog-eny  resulting  from  a  mating  of  a 
sire  of  the  i^h  x^^e  with  a  dam  of  the  jth  line. 

The  third  and  final  step  is  the  multiolication  of  the  RHI-I's  given  above 
by  the  natrix  ii, verse  to  the  least-squares  coefficient  matrix  shovm  in 
Table  22,  as  f oIIohs : 


+  1 
21? 

(SZSi. 
i=j 

-  ?^Sij) 

_  3 

2l? 

2U  ic  ^ 
i*j 

ai:   1  ZZsi^ 
2h  ij  ^ 

+  1 
12 

(ZZs^. 

-  ZZsi-?) 

=  3 
2I; 

11 

-   1  ZZsi^ 
W  ij  ^ 

^223*  T  (sio-sii^-si,2-S2i+S2L+Ei,i)  -|  (s^3-Si[,-S3^*S3^  ^s^^^^-s^^) 

-  I  (S23-S2I,.S32H-S3,.s,,2  3^3)  =  |  ^12'  |  ^13"  |  ^21*  I  ^23"  |  ^2l 
"  I  ^^31-  r  '32^  I  ^3h'  l^h2  -  I  s,,3 

The  values  resulting  from  the  above  multiDlicaticns  are  the  coefficients 
for  the  transformation  mi^trix,  K,  which  is  shcwm  in  Table  23.    If  r*  xs  the 
least-squares  coefficient  matrix  for  the  subclasses,  then  the  matrix  ..nverse, 
C-lj  to  th-?  least-scua-^e  coefficient  matrix  for  m^del  (8)  is  obtained  by 

KD-1  K' 

where  K  is  f'.e  transformation  matrix,  D~l  the  matrix  inverse  ©f  D,  and  K*  i«  th« 
4)rftnsposeof  t^trar'sfhniHtronmatrbc.   Since  Lis  a  diagonal  >itr.ix,its  inverse  can  be 
obtained  quicklv  by  calculating  the  reciorocal  of  each  diagonal  element.  The 
constant  estimates  are  computed  by 

KS  B 

where  S  is  a  colainn  vec'.or  of  the  subcI..tS5  means  and  B  is  t!  e  column  vector 
of  the  constant  esti rates. 


Table  23,  Trsn"  format  ion  matrix  for  nradel.  yii±jk=^*^h'^Vl±±*S2i'*'i2y^2y^'^± 
+r2j_j+ev^-|j^:.  Purebred  effects  estimated  indeDendently  of  general  combining 
ability  and  maternal  ability  effects. 
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0_ 

P        It  can  be  seen  that  if  a  simole  transforation  matrix  is  available  for 
'the  rrodel  desired,  a  great  deal  of  labor  can  be  saved  in  this  type  of  an- 
alysis.   VJhlle  rrany  transfcrm.aticn  matrices  can  be  written  readily  vjithout 
using  the  orocedure  outlined  above,  those  for  more  complex  designs  can  be  ob- 
tained in  this  manner.    As  had  been  pointed  out  nreviouslv,  transformation 
matrices  are  also  useful  when  making  orthogonal  comparisons  among  the  con- 
stant estimates  of  treatment  effects,  where  one  is  concerned  with  cnlv  a  seg- 
msnt  of  the  entire  matrix  inverse,  and  only  those  estirrates  of  the  constants 
for  the  treat^rent  effects  under  consideration. 


Numerical  Examnle 

The  data  used  in  the  numerical  examole  are  those  used  nreviouslv  for 
model  (7),  with  the  ad.'itinn  of  the  linebreds,  which  were  included  in  the  ex- 
periment but  omitted  for  the  previous  exairnle. 


Least-Squares  Anal3rsis 

The  least  squares  ecuations  are  shown  in  Table  2h  where  it  can  be  seen 
that  onlv  the  segment  corresoonding  bv  row  and  column  to  the  general  com- 
bining ability  effects  contains  entries  in  the  off -diagonals.    All  other  seg- 
ments corresponding  b\'  row  and  column  to  one  of  the  effects  being  estimated 
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contain  entjt^s  in  the  dia£onals  onlv.    It  can  also  be  observed  that  the  sum 
of  the  coefficients  for  the  general  combining  ability  effects  (in  all  equa- 
tions in  which  there  are  entries)  equals  twice  the  coefficient  for  [l,  except 
in  the  |i  equation  due  to  inclusion  of  ourebreds  in  the  analvsis. 

After  imrjosing  the  restrictions  on  the  constant  estiinates  suggested  for 
this  model,  the  coefficient  matrix  can  be  inverted  and  the  ixiverse  elements 
necessarv  for  estimating  constants ^  standard  errors,  and  calculating  sums  of 
squares  for  the  analysis  of  variance  can  be  obtained.    The  constants  may  be 
estimated  during  the  process  of  inversion. 

VJhen  a  transfornaticn  matrix  is  available  the  inverse  elements  and  the 
constant  estim.ates  c?n  be  obtained  FiOre  readil^"  than  by  the  inversion  of  the 
least-squares  coefficient  matrix.    Using  the  transformation  i^trir  whlcr  has 
been  r)resented  for  this  model,  the  desired  matrix  inverse  to  the  least- 
squares  coefficient  matrix  can  be  calculated  by  means  of  a  desk  calcula+or  in 
a  verv  short  time.    The  constants  estimates  are  also  comouted  simnlv  and 
quickly. 

Since  the  same  observations  were  utilized  in  estimating  the  general  com- 
bining abilitv,  maternal,  spe':;ific  combining  abilitv,  and  sex-linked  effects 
as  in  the  previous  m.odel,  the  inverse  elements  pertaining  to  these  effects 
and  the  constant  estimates  are  the  same  as  vere  obtained  in  the  previous  an- 
alysis, within  rounding  error.    With  the  inclusion  of  the  linebreds  in  this 
analvsis,  additional  constant  estimiates  are  obtained  for  heterotic  and  pure- 
bred effects.    The  constant  estimates  resulting  from  the  analysis  under 
model  (8)  are  shown  below s 

Si3  -  -.172 
^12  "  '766 
ri3  -  .982 
r23  -  1.229 


The  remaining  constant  estimates  (pi[j[j,  g2^,  etc.)  can  be  obtained  bv  con- 
sidering the  restrictions  that  were  imDOsed  on  the  estimates  and  m.aking  the 
approoriate  additions  and  subtractions.    Due  to  the  size  of  the  matrix  in- 
verse, this  is  not  presented. 

The  total  reduction  in  sums  of  squares  is  calculated  bv  the  usual  pro- 
cedures of  obtaining  the  sum  of  the  products  of  the  constant  estimates  by  the 

corresponding  RHK's 

R(p.,a,p,g,m,c,r)  =  779, 783. 

This  value  can  also  be  obtained  by  calculating  the  uncorrected  sums  of  squares 
for  the  subclasses. 

The  error  sum  of  squares  is  then  calculated  as  the  difference  between  the 
total  uncorrected  sum  of  squares  and  the  reduction  due  to  fitting  all 


il  -  60.322 

-  -1.057 

Pill  -  -3.310 

P122  "  7.081 

P133  -  -5.h73 

hi  -  2.9hl 


g22  -  -1.293 

g23  -  2.753 

m2i  "  -1.6M 

^22  ■  1.'j26 

rn23  "  -^•78U 

Ci2  -  -1.285 
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constants, 

Error  suin  of  squares  =  793,m.89  -  779,783. lil)  =  13,328. 

The  suns  of  squares  for  heterosis,  r>urebreds,  general  coiribining  ability, 
in8.ternal,  specific  combining  abilitv,  and  sex-lirked  effects  are  obtained  by 
the  procedures  previously  outlined.     Completion  of  i/hri-se  sums  of  squares  re- 
sult? ir  the  follcvrLrg  anal^'-sis  of  variance: 

Source  of  Variation  d .f .     Sup:s  of  Squares    i-.eaii  Squares 


Heterosis  (A) 

1 

192.93 

192.93 

Purebred s  (P) 

3 

1878.5c 

626.17 

General  con-tir.irg  abilities(G) 

3 

lOll.i-b 

3U7.15 

I'iaternal  abilities  (K) 

3 

1725.26 

575.C9- 

Sr^ecific  corrbining  abilities(C) 

2 

180.73 

90.36 

Sex-linked  effects  (H) 

3 

267.73 

89.2h 

Error 

192 

13328.1.5 

69.1^2 

In  view  of  the  srr.all  suit;  of  squares  for  the  f^ex-linked  effects,  these 
effects  could  be  eli'dnated  frcni  the  rrcdel  ar.-d  new  estiiTEtes  comnuted  for  the 
reraining  effects.     Should  the  specific  corbining  ability  effects  prove  to  be 
ron-cignificant  after  the  neiv  estimates  have  been  rrade,  these  effects  could 
also  be  elininated  from  the  model  and  more  efficient  estimates  nade  of  the  re- 
maining effects. 

If  standard  errors  are  desired,  the-"-  can  be  computed  from  the  formulas 
previousl'^  presented,  as  follows: 

s[l  =  \/  .CO57899I1    \/  69.1  2  =  (.0761)  (8. 33)  -  .^3 

s(^L+a]^)  =      .0O57C^9l!  +"Too578^9l'.  +  2( .00201.077)  (8.33)  =  (.325)(r.33)  ■ 
=  l.Cl 

s(il+a-,_+Pi2i)  "  1^.00578991:  +  .r057''99):  +  .03^3PP70  +  2(  .0020!;077) 

+  2(-. 00211:8^9)  +  2(-.0021l889j'    (6.33)  =  (.?13)(8.33) 
=  1.77 

The  standard  error  for  a  ]ireci'os?  or  crossbred  group  car  most  easilv  be 
obtained  frorr;  a  cr nsideration  of  the  diagonal  matrix  made  uo  of  the  subclass 
(linebred  and  crossbred  groups)  numbers.    Each  of  these  subclasses  contain 
the  i-i  effect  nlus  subclass  effect,  so  the  inverse  element  from  this  diggonal 
eleiT;ent  corresnoruiirg  to  a  subclass,  Sj_^,  is  the  eler.ent  needed  to  determine 
the  standard  error,  sp_+3.  ^.    To  calculate  the  standard  error  of  the  cross  be- 
tween a  sire  of  the  1st  line  and  a  dam  of  the  2nd  line,  the  subclass  number 
is  12  and  its  inverse  1/^2.    Therefore,  the  standard  error  for  this  crossbred 
or  linebred  group  :  s 

S12  =  /X~  /69II2  =     5.7850  =  2,11 
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The  data  which  have  been  used  in  this  analvsis  coulr  also  have  been  an- 
alyzed using  the  model 


yijk  »  ^  +  8i  +  dj  +  (cd)i^  +  eiji^  

where  71  jk  "  "^^^  observation  on  the  k'tii  offsr)rirg  of  a  nsting  between  the 
darn  and  i"*^'^  sire.    W^dle  an  analvsis  of  this  model    r'oes  not  rdeld  the  de- 
tailed information  of  the  previous  anal^'3is,  it  does  offer  a  measure  of  the 
over-all  performance  of  the  lines  as  sires  and  dam.s.    The  equations  for  an 
analysis  under  this  rodel  woulc  be  written  in  the  usual  wav  for  a  two-wav 
classification  with  interaction.    Again,  the  availabilitv  of  a  transformation 
i-natrix  lessens  the  labor  consic'eralv,    Th-?  transformation  matrix  for  model  (9) 
is  shown  in  Table  2$  for  the  case  of  four  lines. 
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vlelded 

the  following  constant  estlretes: 

il  "  ^0.850          83  -  -1.1.330  (s^)23  -  2.162 

si  -    l.C<3  (8^)11.  -  -5.361  (8'd)3;L  -  2.559 

52  -      .767    <sd)i2*     -'^^^  (sd)32  -  '^^^^ 

53  -   1.1.5U    (sci)i3-     )!.526  (8'^)33  -  -I,.l83 
Si   .   -.5'^^8    (3<i)2i  -  -1.^96 

d2    -    2.193     <«<*>22  "  2.536 

The  remaining  constant  estirr^tes  for  sires,  Hams,  and  interacti.- ns  can  be  ob- 
tained b'^'-  the  usual  nrocediires.    Total  reduction  in  sum  of  squares,  error  sum 
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of  squares,  and  the  sums  of  squares  for  sires,  dairiS,vand  interactions  also 
follow  the  methods  previouslv  outlined.    The  analysis  of  variance  following 
the  comtiletion  of  these  computations  is  shown  below: 


Source  of  Variation      d-.f .      Sums  of  Squares      Mean  Squares 

Sires  (S)                   3  658,1.9  219.50 

Dams  (D)                   3  1L09.62  169.8? 

S  X  D  9  1812.72  201.1-1 

Error  191  13327.55  69.78 

The  significant  interaction  of  sires  and  dams  is  not  unexpected  in  an  ex- 
periment of  this  tvpe  and  makes  the  analvsis  under  model  (9)  of  very  limited 
value.    If  one  is  concerned  only  with  the  particular  crosses  produced  in  the 
experiment,  this  analysis  does  measure  the  value  of  the  lines  as  sires  and 
dams.    However,  there  is  no  information  as  to  what  tyoe  of  gene  action  is  in- 
volved and  so  the  analysis  is  not  of  value  in  form::lating  breeding  olans 
where  one  wants  to  have  information  on  both  additive  and  non-additive  effects, 
such  as  are  brought  out  under  models  (7)  or  (8).    Since  the  interaction  is 
significant  it  is  of  irroortance  to  know  what  contribution  specific  com.bining 
ability  and  sex-linked  effects  have  nade. 


Analysis  of  the  data  under  model  (8)  yielded  an  estimate  of  the  over-all 
amoiint  of  heterosis  and  a  test  of  the  significance  of  this  effect.    The  use 
of  a  simple  transformation  matrix  makes  it  possible  to  obtain  estimates  of  the 
amount  of  heterosis  and  the  inverse  elements  necessary  to  test  the  signifi- 
cance of  the  heterosis  effect  in  each  cross,  and  also  to  make  tests  of  sig- 
nificance among  the  heterosis  estimates.    While  the  constant  estimates  and  the 
tests  of  significance  are  not  independent,  since  the  purebreds  or  linebreds 
are  each  used  three  times,  they  ar«  of  considerable  interest.    The  trans- 
formation matrix  is  shown  in  Table  26  together  with  the  transformed  matrix  in- 
verse of  the  subclass  numbers.    It  can  be  seen  from  the  transformation  matrix 
that  the  constant  estimate  for  heterosis  in  each  cross  is  a  contrast  between 
the  average  of  the  purebreds  and  the  average  of  the  reciprocal  crosses.  The 
ranked  constant  estimates,  with  tests  of  the  significance  of  each  estimate 
and  tests  of  the  significance  among  the  estirrates,  are  shown  below: 


Constant 

Tests  of  Significance 

Cross 

Estimate 

Among 

Crosses 

13 

8.31h** 

Ih 

5.095* 

23 

2.0h7 

3h 

1.175 

12 

.L83 

2h 

-h,Ii31 

The  tests  of  significance  of  the  heterosis  estimate  for  each  cross  was 
made  by  the  analysis  of  variance,  with  a  single  degree  of  freedom  for  each 
estimate.    The  sum  of  squares  for  each  contrast  utilizes  the  constant  es- 
timate and  the  inverse  element  of  the  transformed  matrix  corresponding  by  row 
and  column  to  the  cross  being  considered.    For  example,  the  sum  of  squares 
for  the  estimate  of  heterosis  resulting  froi.:  crossing  line  1  and  line  3  is 


S.Sqs.— ».oTO^  =  885.8!.  . 


Table  26.    Trpnsformation  rr.atrix  used  to  obtain  constant  estinetes  of 
heterosis  effects  for  seoarate  crosses  and  the  transforiried  matrix  inverse  of 
subclass  numbers. 


m 

si2  °13  ^1.  ^21  ' 

'22  ®23  ^2!' 

®31  ®32  ®33 

^31)         ^h2  %3^hl 

"12  * 

-.5 

,5 

0    0   .5  - 

-.5     0  0 

0      0  0 

0  0 

0      0  0 

^13- 

0  . 

,5    0  0 

(!      V,'  yj 

K     n  ^ 

0  0 

0      0  0 

-.5 

0 

0.5  0 

0      0  0 

r\       r\  r\ 
(J       U  U 

0  .5 

0    0  -.5 

^23* 

0 

0 

0      0      0  - 

C  c' 

-.5    .5  0 

n  c" 

0    .5  -.5 

0  0 

0    0  0 

^2li- 

0 

0 

0      0      0  - 

0      0  0 

0    0  , 

.5    0  -.5 

0 

0 

0      0  0 

0      0  0 

0    0  -.5 

.5  0 

0   .5  -.5 

hi  2 

^13 

hlh 

h23 

^2h 

h23 

.0626!.  569 

.0113636!! 

.011363611 

.00961538 

.00961538 

.nooOOOOO 

hl3 

.0113636)1 

.07^03030 

.OII3636L 

.02083333 

.00000000 

.02083333 

hill 

.01136361 

.01136361i 

.0706871^ 

.00000000 

.02083333 

.02083333 

h23 

.00961538 

.02083333 

.00000000 

.07628205 

.00961538 

.02083333 

^2h 

.00961538 

.oorooooo 

.02083333 

.00961538 

.08669872 

.02083333 

^31 

.00000000 

.02083333 

.02083333 

.02083333 

.02083333 

.08333333 

The  analysis  of  variance  is  shown  below: 

Source  of  Variation      d .f .      Sum  of  S  juares      Mean  Square 


Heterosis 


Line 

1 

and 

line 

2 

1 

3.72 

3.72 

Line 

1 

and 

line 

3 

1 

885.93 

P85.93^ 

Line 

1 

and 

line 

L 

1 

366.16 

366.16-;;- 

Line 

2 

and 

line 

3 

1 

51.93 

51.93 

Line 

2 

and 

line 

I. 

1 

226.51 

226.51 

Line 

3 

and 

line 

h 

1 

16.57 

16.57 

Error 

191 

13327.55 

69.78 

The  tests  of  significance  among  the  constant  estimates  for  the  separate 
crosses  were  made  by  means  of  Duncan's  Mtir»le  Range  Test,  the  details  of 
which  have  been  presented  previously. 


sII■mTA^:Eous  at'al^is  of  ^ata  fpoi'  crosses  and  PUPEEPEDS  II 

The  analysis  of  the  data  under  model  (8)  estimated  the  purebred  effects 
independently  of  the  general  combining  abilitv  effects.    The  purebred  effects 
can  also  be  estimiated  by  a  simul  taneous  considei'ation  of  the  general  com- 
bining ability  and  maternal  effects  by  including  these  effects  in  the 
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equations  for  the  purebred  effects,    T^is  method  of  analysis  results  in  the 
saine  constant  estimates  and  sums  of  soiiares  for  all  effects  other  than  the 
purebreds. 


Model 

The  mathematical  model  describing  this  third  method  of  analysis  is: 

yhijk  =  H+ah+Piii+g.i+g.j+m.j+C2ij+r2ij+ehijk  .     •     •     •  O-O) 

where  the  s^nnbol-  represent  the  same  effects  as  those  for  model  (8).    The  sub- 
scripts indicate  a  siimltaneous  ccnsideratif n  of  the  general  combining  ability, 
maternal,  and  *nurebred  effects.  The  least-sojjares  eouaticns  for  model  (10) 
are  presented  in  Table  27, 


CoinDletic;.  of  Analysis 

The  restricts ons  imposed  on  the  constant  estimates  are  the  same  as  those 
for  model  (C)  and  the  remainder  of  the  calculations  proceed  by  the  same 
methods  as  outlined  for  models  (7)  and  (8).    No  numerical  exariTole  is  vve- 
sented  for  this  m.odel  due  to  its  similarity  to  model  (8), 
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